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(54) Title: METHODS AND REAGENTS FOR MOLECULAR CLONING 

(57) Abstract: The present invention provides composidons, methods, and kits for covalently linking nucleic acid molecules. The 
methods include a strand invasion step, and die compositions and kits are usefiil forperfonning such methods. For example, a method 
of covalently linking double stranded (ds) nucleic add molecules can include contacting a first ds nucleic acid molecule, which has 
a topoisomerase linked to a 3* terminus of one end and has a single stranded 5' oveiiiang at the same end, with a second ds nucleic 
add molecule having a blunt end , such that the 5* overhang can hybridize to a coroplementaiy squence of the blunt end of the second 
nucleic acid molecule, and the topoisomerase can covalently link the ds nucleic acid molecules. The methods are simpler and more 
effidoit than previous methods for oovalendy linking nucldc add sequences, and the composidons and kits facilitate practising the 
methods, including methods of directionally finking two or more ds nucldc add molecules. 
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METHODS AND REAGENTS FOR MOLECULAR CLONING 

BACKGROUND OF THE PJVENTION 

FIELD OF THE flVVENTION 

The invention relates gmerally to compositions and metiiods for fecilitating fhe 
construction of recombinant nucleic acid molecules* and more specifically to 
compositions usefiil for covalmtiy linking two or more nucleic acid molecules, including 
for directionally or non-directionally linking the nucleic acid molecules, and to metiiods 
5 of generating such covalently linked recombmant nucleic acid molecules. 

BACKGROUND INFORMATION 

The ability to clone large numbers of nucleotides sequences, including gene 
sequences and open reading frames allows a great deal of information to be obtained 
about gene expression and the regulation thereoE In addition, such sequences can be 

10 usefld for imderstanding the etiology of disease conditions and, ideally, can provide a 
means to diagnose and treat such diseases. However, while it is relatively simple matter 
to clone large numbers of expressed nucleotides sequences, for example, it is a more 
difficult undertaking to characterize tiie regulatory elements involved in tiie expression of 
such sequence and to properly express a polypeptide encoded by the sequence. In 

15 particular, th^ is a need for inq>roved methods for ligating nucleic acid molecules and 
cloning nucleic acid molecules such that a functional recombinant nucleic acid molecule 
is produced There is a particular need for directional cloning methods, wherein an insot 
can be cloned into a vector or linked to one or more othor nucleic acid molecules in a 
predetermined orientation. 

20 The use of topoisomerases provides a convenient means to improve cloning and 

ligation methods. For example, the use of topoisomerase to perform rapid ligation of 
polymerase chain reaction (PGR) products into a vector has cut traditionally laborious 
cloning methods down to a five minute procedure. As such, topoisomerase is particularly 
usefld for high throughput cloning supplications. However, given the current demand for 

25 expressing open reading frames (OKF) in gnome scale molecular cloning procedures, 
fhore still remains a need to better control the orientation in which two or more nucleic 
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acid molecules are linked such that functional recombinant nucleic acid molecules such as 
expressible cloned nucleic acid molecules can be prepared 

Expression of cloned OSFs demands that the PGR product be insoted into the 
vector in its correct orientation, so as to work in accord wilfa functional expression 
5 domains located on the vector, hx the current state of the art for topoisomerase mediated 
cloning, ORFs are amplified by PGR using various DNA polymerases. A polymerase 
such at Taq, which does not have a proof-reading function and has an inherrat terminal 
transferase activity, is commonly used, and produces PGR products containing a single, 
non-tOTiplate derived 3' A overhang at each end. These anq)lification products can be 

10 eflBciently cloned into topoisomerase-modified vectors containing a single 3' T oveihang 
at each end (TOPO TA Qoning® Kit, hivitrogen Corp,, Carlsbad, CA). In comparison, a 
polymerase such as pfii, which has an inherent 3' to 5' exonuclease proof-reading activity, 
produces PGR products that are blunt-ended. Topoisomerase-modified vectors containing 
blunt ends are available for cloning of PGR products produced with pToolGreading 

15 polymerases (Zero Bhmt TOPO® PGR Cloning Kit, Invitiogen Corp., Carlsbad, GA). 
Incubation of eith^ PGR product and the proper topoisomeras&-modified vector results in 
five minute ligation. However, the orientation of the insert obtained using such cloning 
methods is random. 

Because the orientation of DNA Augment ios^tion into topoisomerase-modified 
20 cloning vectors is random, users must screen clones to identify those having the proper 
orientation. Insert oriCTtation can be determined using various methods including, for 
example, restriction en2yme analysis, in vitro transcription fix>m vector-encoded promoter 
elements, and PGR using, for exany)le, one insert-specific primer and one vector-specific 
primer. As is evident, however, the requiremmt for detenninmg insert oriratation 
25 requires an investment of time and can substantially increase the cost for id^tifying a 
nucleic acid molecule of interest, particularly where a high throughput cloning method is 
used. As such, current cloning methods are severely limited, particularly for hig^ 
througi^ut expression analysis for several reasons, because numerous laborious 
steps must be performed in order to select clones with correctly oriented inserts, and fliere 
30 is a need to screen as many as eigjit colonies of each clone to identify one having the 
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proper orientation. Thus, a need exists for mefliods and reagents fliat are useful for 
covalently linking two or more nucleic acid molecules in a directional orientation. The 
present invention satisfies this need and provides additional advantages. 

SUMMARY OF THE INVENTION 

The present mvention provides con^ositions and methods for covalently linking 
two or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, etc.) double stranded ("ds") nucleic add 
molecules, including directionally or non-directionally Imking two or more ds nucleic 
acid molecules. Nucleic acid molecules used in agcordance with the mvention preferably 
comprise a first end and a second end The first and/or second end of such molecules 
preferably has a S* and/or 3' extension or overhang. Thus, one or both ends of the nucleic 
acid molecules used m the invention can have a 3' and/or 5' ovechang. The overiiang 
sequences can be the same or different sequences, and can be the same or different types 
(e.g., 3' or 5* ov^hang) at botii ends of the molecule. In addition, while one md of die 
nucleic acid molecule can have a 3* extension and 5' extension, tiie oth^ end of the 
molecule can, but need not, have an extension. In some aspects, one end of nucleic acid 
molecule can contaux a 3* overiiang or 5' overiiang while the other end can be blunt ended 
(Le., it has no overiiang). In accordance with the mvention, die 3' and/or 5* extension 
sequences (i.e., overhangs) at any terminus can be any length (Le., any number of 
nucleotides), and can have any sequence. Thus, the invmtion relates to nucleic acid 
molecules having single or multiple nucleotide overiiangs. In some aspects, the nucleic 
add molecules and their termini can include modified or labeled nucleotides. In flie use 
of the invention, enzymes or piotems capable of fiising or joining or ligatingnucldc acid 
molecules can be used Thus, two or more nucldc acid molecules, which can be the same 
or different, can be joined directionally usmg such ^izymes. Such enzymes or proteins 
include, but are not limited to, topoisomerases (including types lA, IB, n, etc.), 
recombinase protems (including FLIP recombinase» hit mtegrase, ere recombinase^ etc.), 
and ligases (includmg T4 DNA ligase, etc.). 

In the methods of flie mvention, tiie 3' or 5' overiiang of one temodnus of the first 
nucleic acid molecule can have homology (or is complementary) to at least one sequence 
at or near the teiminus of at least a second nucldc add molecule. Thus, tiuoug^i base 
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pairing or hybridization of the 3' or 5' overhang or extension with the homologous or 
complementary sequence on the second molecule, the invention allows directional or non- 
directional association or joining of two diflferent molecules. In a preferred aspect, the 
3* or 5' overiiang of one terminus of at least a first molecule can ragage in strand invasion 
5 as it associates or hybridizes with its complementary sequence at or near the tmninus of 
the second molecule. In one aspect^ such a strand invasion event allows the 3' or 
S' overhang to directionally associate with a desired end of its partn^ molecule. By 
designing the overhangs and the tcaxnm of the molecules to be jomed, two or multiple 
partner molecules can be joined in the presrace of one or more proteins or CTzymes 

10 having ligase activity (e.g., topoisomerases, ligases, recombinases, in accordance 
with the invention. Thus, the invention provides methods for connecting two or more 
nucleic acid molecules (e.g., double stranded nucleic acid molecules) which involve 
covalCTtly linking at least one strand of one molecule to at least one strand of another 
molecule. The invention further provides compositions for preparing nucleic acid 

IS molecules connected by methods of tiie invmtion and compositions produced by methods 
of the invention. 

Processes of the iavention are exemplified by methods described herem which 
involve tiie covalmt linkage of strands of different nucleic acid molecules catalyzed by 
topoisomerase. Thus, the present invration relates, in part, to an isolated ds nucldc add 

20 molecule having a first end and a second end, wherein the first end contains a first 

5* ovGtfaang and a first topoisomerase coval^tiy bound at the 3' tCTninus, and the second 
end contains a second topoisomerase covalently bound at the 3* terminus and contains a 
second 5* ovetfaang, ablunt ead, or a 3' thymidine overiiang, wfaerran tfie first 5' ovediang 
is difi^erent 6am the second 5* oveifaang. The first topoisomerase and second 

25 topoisomerase can be the same or different The first S'overimg can have any nucleotide 
sequence, including, for exanq^le, the nucleotide sequence S-GGTG-3*. 

In one embodiment, the ds nucleic acid molecule is a vector, which can be a linear 
vector such as a lambda vector or a linearized vector such as a linearized plasmid The 
vector can be a clonmg vector or an expressicm vector, and can contain, for exaoqyle, one 
30 ormore (e.g., 1, 2, 3, 4, 5, 6, etc.) recombmase recognition sites such as one or more lox 
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sites or one or more att sites, one or more transcriptional regulatory elements, one or more 
tianslational regulatory elements, one or more nucleotide sequences encoding a peptide of 
interest such as one or more selectable maikers or one or more tags, or combinations 
thereof. For example, the vector can be a pUoiA^S-His version A (SEQ ID NO: 16) vector 
5 or apCR®2.l (SEQ ID N0:17) vector. 

The present invention also relates to methods of directionally or non-directionally 
linking two, three, four or more nucleic acid molecules, including, as desired, operatively 
linking two or more of the nucleic acid molecules. A method for generating a 
directionally linked recombinant nucleic acid molecule can be p^ormed, for example, by 
contacting a first topoisomerase-charged first ds nucleic acid molecule, which has a first 
topoisomerase covalently bound at a first end, and a second topoisomerase covalently 
bound at a second end, and also contains a S' oveiiiang at the first end and abhmt end, a 
3' uridine overhang, a 3' thymidine overhang, or a second 5' overhang at the second end; 
and at least a second ds nucleic acid molecule, which has a first blunt end and a second 
end, wherein the first blunt ^d has 5* nucleotide sequence that is complementary to the 
first 5' overhang of the first end of the first nucleic acid molecule. Tlie first and second 
topoisomerases can be flie same, for example, two type IB topoisomerases such as two 
Vaccinia type IB topoisomerases, or can be different, including two type IB 
topoisomerases fix>m different organisms or a type IB topoisomarase and a type lA or a 
^e n topoisomerase. 

In performing a method of the invention, the first and second (or other) ds nucleic 
acid molecules are contacted under conditions such that the S' nucleotide sequence of the 
first blunt end of the second nucleic acid molecule can selectively hybridize to the first 
5' overhang, whereby the first topoisomerase can covalently Unk the 3' terminus of the 
25 first ead of the first ds nucleic acid molecule to the 5* terminus of the first blunt end of the 
second ds nucleic acid molecule, and the second topoisomerase can covalently link the 
3' tenninus of the second end of flie first ds nucleic add molecule to the S* temiinus of the 
second end of fbo second ds nucleic acid molecule, to gmerate a dhectionally linked 
recombinant nucleic add molecule. Accordingly, tfie present invention provides a 
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directionally or non-directionally linked recombinant nucleic acid molecule produced by 
such a method 

In one aspect of performing a method of the invention, the second end of the first 
topoisomerase-charged ds nucleic acid molecule has a blunt end, and the second end of 
5 the second ds nucleic acid molecule has a blunt end. hi another aspect, the second end of 
the tGpoisomerase-chaiged first ds nucleic acid molecule has a 3' thymidine overhang, 
and the second end of the second ds nucleic acid molecule has a 3* adenosine overhang, or 
the second end of the topoisomerase-charged first ds nucleic acid molecule has a 
3* uridine (or modified form thweo^ for example, deoxyuridine) overhang, and the 

10 second end of the second ds nucleic acid molecule has a 3' adenosine oveiiiang. In yet 
another aspect, the topoisomerase-charged first ds nucleic acid molecule has a second 
5' overhang at the second end, and the second end of the second ds nucleic acid has a 
nucleotide sequence con^lementary to flie second 5' overhang. The topoisomerase- 
charged first ds nucleic acid molecule can, but need not be, a vector, including a cloning 

IS vector or an expression vector. 

A method of the invention can finlfaer include introducing a directionally or 
non-directionally-Iinked recombinant nucleic add molecule into a cell, which can be a 
prokaryotic cell such as a bacterium or a eukaryotic cell such as a mammalian cell. 
Accordingly, the present invention also provides a cell produced by a method of the 
20 invention, as well as a non-human transgenic organism produced fix>m such a cell. 

The topoisomerase-charged first ds nucleic acid molecule can be a vector, and the 
second ds nucleic acid molecule used in a method of tiie invention can be an 
anq>lification product In addition, the second ds nucleic add molecule can be one of a 
pluraKty of second ds nucleotide molecules, for example, individual members of a cDNA 
25 library or a combmatoriallibraiy« 

A method for generatmg a dhectionally or non-directionally linked recombinant 
nucleic acid molecule also can be performed, for exanq>le, by contacting a first precursor 
ds nucldc acid molecule havmg a first wd, has a first 5 ' target sequence at the 
S* tenninus and a topoisomerase recognition site at the 3* termmus, and a second end. 
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which has a topoisomerase recognition site at the 3' terminas; a second ds nucleic acid 
molecule havmg a first blunt end and a second end, wh^in the first blunt end has a 
S' nucleotide sequence complementary to the 5* target sequence of the first precursor ds 
nucleic acid molecule; and a topoisomerase that is specific for the topoisomerase 
S recognition site. The first ds nucleic acid molecule, second ds nucleic acid molecule and 
topoisomerase are contacted under conditions that allow topoisomerase activity, Lc, such 
that the topoisomerase can bind to and cleave the recognition site, to produce a 
topoisomeras&^harged 3' terminus, and can ligate tiie 3* terminus to an appropnato 
S' terminus. Such conditions also allow hybridization of tiie portion of tiie first 5' target 
1 0 sequence that remains following cleavage by the topoisomerase and the 5* nucleotide 
sequence of tiie first blunt end of the second ds nucldc acid molecule, wherein the 
S* nucleotide sequence of ttie first blunt end is compl^entary to that portion of tiie 
5' tar^t sequmce. 

In one aspect of performing a metiiod of tiie invention, the second end of the first 
IS precursor ds nucleic acid molecule is a blunt end upon cleavage by tiie topoisomerase, 
and tiie second end of the second ds nucleic acid molecule is a blunt end. In anotiier 
aspect, the second end of the first precursor ds nucleic acid molecule has a 3' tiiymidine 
extension upon cleavage by the topoisomerase, and tiie second end of tiie second ds 
nucleic acid molecule conq>rises a 3' adenosine or 3 -uridine, for example, deoxyuridme 
20 overhang, in yet anotiier aspect, tiie first precursor ds nucleic acid molecule has a second 
5 ' target sequence at tiie second end, and the second end of the second ds nucleic acid 
molecule has a 5* nucleotide sequence complementary to at least a portion of the second 
S' target sequence. 

The first precursor ds nucleic acid molecule can be a vector, including a cloning 
25 vector and an expression vector, and, where the vector gmerally is available in a cncolar 
form, can be linearized due to the action of the topoisomerase, or can be linearized by 
including, for example, one or two restriction endonucleases that linearize the vector such 
that, upon contact with the topoisommse, the first and second ds nucleic add molecules 
can be directionally or non-directionally linked according to a method of the invention. 
30 The preset invmtion also provides a directionally or non-directionally linked 
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recombinant nucleic acid molecule produced according to a method of the invention, 
which can further include, for exanq)le, a step of introducing the directionally-Iinked 
recombmant nucleic acid molecule into a cell. Accordingly, the present invention also 
provides a cell containing such a directionally or non-directionally linked recombinant 
S nucldc acid molecule, as well as a transgenic non-human organism generated fiom such a 
cell. 

The first precursor ds nucleic acid molecule can include one or more (e.g., 1, 2, 3, 
4, 5, 6, 7, etc.) e^qnression control elements, which can be operatively linked to each other, 
and the second ds nucleic acid molecule can encode all or a portion of an open reading 
10 frame, wherein the expression control element is operatively linked to the open reading 
frame in a directionally linked recombinant nucleic acid molecule generated according to 
a method of the inventiorL In addition, the second ds nucleic acid molecule can be one of 
a plurality of second ds nucleic acid molecules, for example, individual members of a 
cDNA library. 

IS A method for generatmg a directionally linked recombmant nucleic acid molecule 

also can be performed by contacting a topoisomerase-charged first ds nucleic acid 
molecule, vdiich has, at a first end, a first 5' overhang and a first topoisomerase covalently 
bound to the 3' terminus, and a second ds nucleic acid molecule, which has a first blunt 
end and a second end, wherein the first blunt end includes a 5' nucleotide sequence 

20 complementary to the Gxst 5' overhang. The method is performed under conditions such 
that the 5' nucleotide sequence of the first blunt end can selectively hybridize to the first 
5' overhang, whereby the first topoisomerase can covalently link the 3' terminus of flie 
first md of the first ds nucleic acid molecule with the 5' terminus of the first end of ttie 
second ds nucleic acid molecule. 

25 Such a mefliod can fiuther include contacting the topoisomerase-charged first ds 

nucleic acid molecule and the second ds nucleic acid molecule with a third ds nucleic add 
molecule, wh^:ein a first end of the third nucldc ds add molecule has a 5* overhang and a 
second topoisomerase covalently bound at the 3' terminus, and wherein the second ds 
nucleic acid molecule has a second blunt end, which includes a S* nucleotide sequence 

30 complementary to the seconds* overhang. The contacting can be performed, for 
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example, under conditions such that the 5' nucleotide sequence of the second blunt end of 
the second ds nucleic acid can selectively hybridize to the 5* overhang of the jfibcst end of 
the third ds nucleic acid molecule, whereby the second topoisomerase can covalently link 
the 3* terminus of the first end of the third ds nucleic acid molecule with the 5* tenninus of 
5 the second blunt end of the second ds nucleic acid molecule. Sknilarly, the method can 
be used to directionally or non-dhectionally link a fourth, fifth, sixth, or more ds nucleic 
add molecules, wherein the mis of such ds nucleic add molecules are selected as 
exemplified herein. The first and second (or oth^) topoisomerases can be the same or 
differmt and, if desired, the first or third ds nucleic acid molecules, instead of bemg 
10 topoisomerase-charged, can contain a topoisomerase recognition site, whodn the method 
can GxrOxet include contacting the reactants with a topoisomerase. 

A method of the invention can be performed simultaneously or sequentially. A 
method of the mvoation can be perfomied sequentially, for example, such that the first 
ds nucleic acid molecule is directionally linked to flie second ds nucleic acid molecule 
15 and, at a later time or in a diff^^t reaction vessel, the third ds nucleic acid molecule is 
directionally linked to flie second ds nucleic acid molecule. Alternatively, the method can 
be performed simultaneous, \^erein all of the reactants are included together at the same 
time. 

Methods of the invention are particularly usefiil for operatively linking two or 
20 more (e.g., 2, 3, 4, 5, 6, 7, 8, eto.) ds nucldc add molecules, including, for example, 
operatively linking an expression control elemmt to an open reading ftame, or 
operatively linking a first and second open reading fimie to generate a recombinant 
nucldc acid molecule encoding a fusion protein, which can be further operatively linked 
to one or more expression control element For example, in practicing a method of the 
25 invention, a first ds nucleic acid molecule can include an expression control element, a 
second ds nucleic add molecule can encode an open reading frame, and a third ds nucldc 
acid molecule can encode a peptide, wherein, in the directionally linked recombinant 
nucleic acid molecule, the expxesAon control elonent is operatively finked to the open 
reading fi:ame, and the second ds nucldc acid molecule is operatively Uhked to the third 
30 ds nucldc add molecule, and \^erem the opmttively luiked second and fbkd ds nucldc 
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acid molecules encode a fusion protein comprising the open reading ftame and the 
peptide. The peptide can be any peptide or polypeptide, including a gene product or other 
open reading frame, a tag (e,g., an affinity tag), a detectable label, and/or the like. 

The present invrntion also relates to a conqx)sition, which includes a first 
ds nucleic acid molecule having a first end and a second md, wherdn the first otid has a 
5' ovethang and a topoisomerase covalently bound at the 3' terminus; and a second ds 
nucleic add molecule having a first blunt end and a second end, wherein the first blunt 
end has a first 5* nucleotide sequence, which is complanentary to the first 5*-overiiang, 
and a first 3* nucleotide sequence complementary to the first 5' nucleotide sequence. In 
such a composition, the first 5* nucleotide sequence of the first blunt end of the second ds 
nucleic acid molecule can be hybridized to the first 5* overhang of the first end of the first 
nucleic acid molecule, wherein the Gist y nucleotide sequence of the first blunt end of the 
second ds nucleic acid molecule is displaced. The first ds nucleic acid molecule in such a 
composition can further have a second 5* overhang at the second end, and the second end 
of the second ds nucleic acid molecule can further include a second 5* nucleotide 
sequence, which is complementary to the second 5' overiiang, and a second 3' nucleotide 
sequence complementary to the second 5* nucleotide sequence. 

The present invention also relates to Mts, whidh contain one or maie reagents 
usefiil fi>r dhectionally linkmg ds nucleic add molecules. In one embodunent, a kit of the 
invention contains a ds nucldc acid molecule having a first end and a second end, 
^i^orein fce first end contains a first 5' ovediang and a first topoisomerase covalentiy 
bound at the 3' terminus, and the second end contains a second topoisomerase covalentiy 
bound at the 3' terminus and contains a second 5' overhang, a blunt end, or a 3' thymidine 
overhang, wherein the first 5' ovethang is diflferent fix)m the second 5' overhang. The 
topoisomerases can be the same or diflferOTt, and the ds nucleic acid molecule can be a 
vector, and can contain an expression control elooient 

In another embodiment, a kit of the invention contains a first ds nucleic add 
molecule, vMch has a first topoisomerase co valmtiy bound at a 3 * tenninus of a first end, 
and a second topoisomerase covalentiy bound at a 3' terminus of a second end, wherdn 
the Srst end also has a first 5* ovediang and the second end also has a blunt end, a 
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3' thymidme overhang, or a second 5' overhang, wherein, wh^ present, the second 
5' overhang is different from the first 5' overhang; and a plurality of second ds nucleic 
acid molecules, wherein each ds nucleic acid molecule in the plurality has a first blunt 
end, and wherein the 6rsl blunt end includes a 5' nucleotide sequence complementary to 
5 the first 5' oveiiiang of the first ds nucleic acid molecule. The second ds nucleic acid 
molecules in the plurality can be a plurality transcriptional regulatory elements, 
translational regulatory elements, or a combination thereof or can racode a plurality of 
peptides such as peptide tags, cell compartmentalizadon domains, and the like. 

A kit of the invention can contain one or more (e.g,, 1, 2, 3, 4, 5, 6, 7, 8, etc.) 

10 topoisomerase-charged ds nucleic acid molecules of the invention, for example, one or 
more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, etc.) topoisomerase-charged vectors; one or more (e.g, 1, 
2, 3, 4, 5, 6, 7, 8, etc.) precursor ds nucleic acid molecules, which can be contacted with a 
topoisomerase to produce a topoisomerase-charged ds nucleic acid molecule of the 
invention; or a combination thereof. The kit also can contain one or more primers or 

15 primer pairs, for example, for preparing one or a plurality of second ds nucleic add 

molecules using an amplification reaction; one or more control ds nucleic acid molecules 
to test or standardize the components of the kit; one or more ceUs, which can be, for 
example, competent cells into which a recombinant nucleic acid molecule generated 
according to a method of the invention can be mtroduced; one or more (e.g., 1, 2, 3, 4, 5, 

20 6, 7» 89 etc.) reaction buffers for performing a method of the invention; instructions for 
carrying out the method; and the like. 

In one embodiment, a method for gen^ating a directionally or non-directionally 
linked recombinant nucleic acid molecule is performed usiog a first ds nxicleic acid 
molecule with one single stranded overhang, and one topoisomerase site or one 

25 topoisomerase bound thereto. In another embodiment, a third nucleic acid molecule is 
included In accordance with this aspect of the invention, unique overhang sequences for 
the different ds nucleic acid molecules to be linked can be prepared having unique 
oveciiangs such that the nucleic acid molecules can be linked directionally and in any 
desired order. Similarly, the method can be used to link any number of nucleic acid 

30 molecules, hxcluding directionally linking two or more of the number of nucldc add 
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molecules. In certain embodimrats involving a topoisomerase-charged ds nucleic acid 
molecule containing an expression control element, a third (or other) ds nucleotide 
sequ^ce also can comprise one, two or more expression control elements or other 
sequence of interest. 

5 Hie present invention provides a method for the duectional msertion of DNA 

fragments into cloning or expression vectors with the ease and efficiency of 
topoisomerase-mediated cloning. This method has advantages over current cloning 
systems because it decreases the laborious scre^iing process necessary to identify cloned 
inserts in the desired orientation. In one aspect, the method utilizes a linearized 
10 e)q)ression vector having a single topoisomerase molecule covalently attached at both 

3* ends. Afirst end of the linearized vector also can contain a 5' single stranded ovrahang, 
and the second end can be either blunt, possess a single 3' thymidine extension for T/A 
cloning, or can itself contain a second S* single stranded oveihang sequmce. The single 
stranded overhang sequences can be any convenient or desired sequence. 

IS Construction of a topoisomerase-charged cloning vector can be accomplished by 

CTdonucIease digestion of ttie vector, followed by complementary annealing of synthetic 
oligonucleotides and site-specific cleavage of tilie heteroduplex by Vaccinia 
topoisomCTase L Digestion of a vector with any conq)atible endonuclease creates specific 
sticky ends. Custom oligonucleotides are annealed to these sticky ends, and possess 

20 sequences that, following topoisomerase I modification, form custom ends of the vector. 
The sequence and length of the single stranded ove±ang will vary based on the desires of 
the user. 

Jn a prefented use of the single strand sequence topoisomerase-charged ds nucleic 
add vectors provided by the present invention, fiie DNA fi:agme&t to be inserted into the 

25 vector is an amplification reaction product such as a PCR product Following PCR 
anq)lification wifii custom primers, the product can be duectionally inserted into a 
topoisoma:as6 1 charged cloning vector having a single strand sequence on one or both 
ends of the insertion site. The custom primers can be designed such tiiat at least one 
primer of a given primer pair contains an additional sequence at its S' end. The added 

30 sequence is designed to be complementary to the sequence of the single stranded 
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oveihang in the vector. The complementarity between the 5' single stranded oveihang in 
the vector and the 5' end of the PGR product mediates the directional msertion of the PGR 
product into the topoisomerase-mediated vector. Specifically, since only one end of the 
vector and one end of the PGR product possess complementary single stranded sequence 
5 regions, the insertion of the product in this mstance is directional, and topoisomerase can 
catalyze Kgation of the PGR product to ttie vector. 

BRIEF DESCRIPTIOINr OF THE tTGURES 

Figures 1 A to IE dq>ict a numbo* of 6s nucleic add molerailes tiiat can be used to 
practice various aspects of the mvention. The circled and boxed areas shown in these 
10 dqnctians indicate r^ons which contam sufBcirat nucleotide sequoice complementarity 
to fflgage in strand invasion wifli each other. 

Figure 1 A shows two ds nucleic acid molecules Oabeled "first^ and "second" 
molecules) which each contain one tmnmus float is cf^able of en^gmg m strand 
invasion with a termmus of flie second molecule (see boxes). When a topoisomerase is 

15 used to covalenHy Imk (e.g., Ugate) strands of each molecule, the 3' recessed strand of the 
terminus of flie first molecule will generally be diarged with topoisomerase. F^er, tiiis 
topoisomerase will generally catalyze the covaloit linkage of the 3' recessed strand of the 
temunus of the first molecule to tiie S* strand of the second molecule wifli which it 
engages in strand invasion (La, the 5' tennmus of the second nncldc add molecule nrfiich 

20 is shown in fbs box). 

Figure IB shows two ds nucleic acid molecules Qabeled "first" and "second" 
molecules), each of which contains two termini fliat are cspsble of engaging in strand 
invasion with termini of the otha- molecule. Further, each of these two nucldc add 
molecules has a blunt temiiinis and a terminus with a 5' angle stranded oveAang (see 
25 circles and boxes). The nucldc acid molecules in fliisdqnction can tims engage in two 
sq>arate strand invasion evaits whidi, i^pon covalent Imkage of nucldc add stiands at 
eadi tennini, result in flie fi>rmation of a single, drcular nuddc add molecule. Covalent 
linkage of ttie tecmini can be penformed as desoibed, Ifor exanqple, for Figure lA, above. 
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Figure IC shows two ds nucleic acid molecules (labeled "first" and "second" 
molecules), each of which contains two tennini that are cs^able of engaging in strand 
invasion with diflferent tennini of the other molecule. Further, one of these molecules has 
tw^o blunt tennini and the other molecule has S* single stranded ovediangs on each 
S terminus. The molecules in this depiction can thus engage in two separate strand invasion 
events which, upon covalent linkage of nucleic acid strand at each tennini, result in the 
formation of a circular nucleic acid molecule. Covalent linkage of the tenmni can be 
performed as described for Figure 1 A, above. 

Figure ID shows three ds nucleic acid molecules (labeled "first", "second" and 
10 "third" molecules). Two of these molecules ("first" and "third" molecules) contain 
5* single stranded overhangs which are capable of engaging in strand invasion with 
different blunt termini of the other molecule ("second" molecule). The molecules in this 
depiction can thus engage in two separate strand invasion events, which result in the 
generation of a linear nucleic acid molecule composed of all three molecules. Covalent 
IS linkage of the termini can be performed as described for Figure 1 A, above. 

Figure IE shows nucleic acid molecules similar to those set out in Figure ID, 
above, except that one of the nucleic acid molecules ("second" molecule) has 
5* ov^hangs at both tennini and the other two nucleic add molecules ("first" and 
"second" molecules) each have two blunt t^mini. 

20 Figure 2 illustrates an aspect of the invration involving strand invasion of a first 

ds nucleic acid molecule with a substantially bhmt nd containing a topoisomerase at a 
3' toxninus of a first strand containing a 5* tail upstream of a tppoisomerase recognition 
site; and a second ds nucleic acid molecule having a 3' ovediang complementary to the 
5* tail (see Chmg and Shuman, Mol Cell, BioL 20:8059-8068, 2000), The boxed areas 

25 shown in these dqnctions indicate regions which contain sufficient nucleotide sequence 
conq)lmientaiity to engage in strand uwasion with each other. 

Figure 3 provides the nucleotide sequence and ttie location of restriction 
endonuclease recognition sequmces of the Multiple Cloning Site of pUniA^5-His 
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version A (SEQ ID NO: 18), and a plasmid map of this 23 kb vector. EcoRI cloning site 
is located at nucleotide 471, and SacI cloning site is located at nucleotide 528. 

Figure 4 provides liie nucleotide sequence and the location of restriction 
eadonuclease recognition sequences of the Multiple Cloning Site of pCR 2.1, and a 
5 plasmid m^ of this vector. Hindm cloning site is located at nucleotide 234, Spel is at 
nucleotide 258 and EcoRI is at nucleotide 283 and nucleotide 299. The vector is 
3906 nucleotides. LacZ alpha fragment: bases 1-587; M13 reverse priming site: 
bases 205-221; Multiple cloning site: bases 234-355; T7 promoter/criming site: bases 
362-381; M13 forward (-20) priming site: bases 389-404; M13 Forward (-40) priming 
10 site: bases 408-424; fl origin: bases 546-960; kanamycin resistance ORE: bases 1294- 
2088; ampicillin resistance ORE: bases 2106-2966; ColEl origin: bases 3111-3784. The 
illustrated vector represents the pCR®2.1 vector with a PCR product inserted by 
TA Cloning®. Note that the inserted PCR product is flanked on each side by BcoRI sites. 
The arrow indicates die start of transcription for the T7 RNA polymerase. 

15 Figure 5 provides the nucleotide sequence of the Vector pUniA^5-His version A 

sequence (SBQ ID NO:16). 

Figure 6 illustrates digestion of pUniA^5-His version A with EcoRI and SacI, and 
the resulting cohesive end sequCTces. The resulting cohesive end on flie left side of the 
figure near the loxPel^ent is the resdting cohesive end post EcoRI digestion. The 
20 resultmg cohesive ead on the rig^t side of the figure near the V5 elment is the resulting 
cohesive mi post SacI digestion. Vector elements includmg a loxP, V5, and 6XHis 
eUmwt as well as a stop codon in fi:ame with these dements are indicated. 

Figure 7 illustrates the addition of adqjter oligonucleotides to the digested vector 
in tiie presence of DNAligase. The reaction yields the exhibited Imeaiized^ adapted 
25 vector. Ads^t^ sequences are und^lined for demarcation. The four ad^tor 
oligonucleotides have the following sequences: 

TOPODl: 5'.AATTGATCCCTTCACCGACATAGTACAG-3' (SEQ ID N0:5) 

TOPO D2: 3'-CTAGGGAAGTGG-5' (SEQ ID N0:6) 

TOPO D3: 3'-GACATGATACAGTTCCCGC-y (SBQ ID N0:8) 
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TOPO D4: 5'-AAGGGCGAGCT.3' (SEQ ID N0:7) 

T4 ligation reaction will yield the indicated linearized cloning vector, adaptor sequences 
are underlined for demarcation. 

Figure 8 illustrates a topoisomerase cleavage reaction wherein following 
5 topoisomerase cleavage of the scissile strand, a phosphate bond m the non-scissile strand 
keeps the leaving group associated to the vector. In the reaction shown, topoisomerase is 
added to the depicted ds nucleic acid molecule. Topoisomerase binds CCCTT and breaks 
the adjacent phosphodiester bond Phosphodiester bonds between the adapted vector and 
the annealing oligo in the non-scissile strand prevent the dissociation of the leaving group 
10 iq)on cleavage, hi the double stranded DNA model illustrated, X and x represent 
complementary nucleotide bases. 

Figure 9 illustrates a topoisomerase cleavage reaction wherein following 
topoisomerase cleavage of the scissile strand, flie lack of a phosphate bond m the 
non-scissile strand allows the leaving group to dissociate fiom the vector, hi the reaction 
IS shown, topoisomerase is added to the depicted ds nucleic acid molecule. Topoisomerase 
bmds CCCTT and breaks the adjacent phosphodiester bond. Lack of a phosphodiester 
bond between the adapted vector and the annealmg oligo in the non-scissile strand allows 
the dissociation of flie leaving group upon cleavage. lii the double stranded DNA model 
illustrated, X and x represent cotq>lementaiy nucleotide bases. 

20 Figure 10 illustrates that addition of an annealmg oligonucleotide to the hnearized, 

adapted vector m the absence of DNA ligase yields the exhibited Imearized, adsqpted and 
annealed vector. Note that the annealing oligonucleotide is not bound to the vector by a 
phosphate bond, tiius, allowmg dissociation following topoisomerase mediated cleavage. 
Adapter oli^nucleotides are demarcated by a smgle underline, vAnie annealing 

25 oligonucleotides axe demarcated by a double underline. There are no phophodiest^ 
Imkages between eitiier of the TOPO D3s and then: adjacmt oligonucleotides TOPO D2 
and TOPO DS The annealmg oligonucleotide has the following sequence and is 
complementary to both TOPO Dl's and TOPO D4's single stranded ov^hang: TOPO D3 
3'<TGTATCATGTCAAC-5' (SEQ ID NO:10). 
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Figure 1 1 shows an example of a linearized topoisomerase-charged ds nucleic 
acid cloning vector of the invention. The single stranded overhang corresponds to a 
Kozak transcription sequence. The vector illustrated is a Imearized TOPO flap cloning 
vector, modified pUni/His version A. 

5 Figure 12 is flie nucleotide sequence of vector pCR 2.1 sequence (SEQ ID 

N0:17). 

Figures 13A and 13B show forms of the pCR2.1® vector 

Figure 13 A shows pCR2.1® following restriction digestion with EcoRI and 
Hindm (note the resulting sticky ends). Four ad^ter oligonucleotides were ligated to the 
10 linearized vector. TOPO binding sites on the oKgonucleotides have the sequence CCC^ 
(underlined). Sticky end complementary bases are deleted in bold. The four ad£^ter 
oligonucleotides had tiie following sequences: 

TOPO H: 5'-AGCrCXiCCCTTATTCCGATAGTG-3' (SEQ ID N0:11); 
TOPO 16: 3*-GCGGGAATAAG (SEQ ID NO: 12); 
15 TOPO 1: 5'-AATTCGCCCTTATTCCGATAGTG.3'(SEQIDNO: 13); and 
TOPO 2: 3'-GCGGGAA-5' 

TOPO H and TOPO 1 have 5' ends that conq)lement the Hmdm and EcoRI sticky ends, 
req>ectively. 

Figure 13B shows tiie ad^ted version of pCR2.1® followmg mcubation with the 
20 adapter oligos in the presence of T4 ligase. 

Figure 14 illustrates the addition of annealing oligonucleotides to the ads^ted 
pCR2.1® vector, followed by flie binding of topoisomerase I and the topoisomerase 
mediated cleavage of the double stranded vector. The resulting vector is linear and 
charged with topoisomerase I on both ends. Also> one end of the vector has the custom 
25 4 bp single stranded sequence, winlo the oflier end is blunt In the mitial reaction 

illustrated, topoisom^ase binds and cleaves the double stranded DNA at the 5' end of the 
covalent binding site located near the ends of pCR2«l^, which contain the bound adi^ter 
and annealing oligonucleotides. This stq) is perfonned in the pres^ce of T4 
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polynucleotide kinase. The annealing oligonucleotides have the following sequences: 
TOPO 3: 3'-TAAGGCTATCACAAC-5' (SEQ ID NO: 15); and 
TOPO 17: 3'-GCTATCAC-5' 

There are no phophodiester bonds fonned between TOPO 3 and TOPO 2, or between 
TOPO 17 and TOPO 16. The annealing oligonucleotides are double underlined for 
donaication. 

Figure 15 illustrates a second sample of a linearized topoisomerase-charged 
ds nucleic acid cloning vector of the present invention. In this example the single 
stranded overhang sequence is 3 -TAAG-5'. This vector is the linearized, TOPO charged, 
FLAP vector, modified pCR2,l® 

Figure 16 illustrates PGR amplification of a gme of interest using primers 
designed for directional cloning. The resulting product possesses the necessary single 
stranded oveihang for dkectional cloning using a vector of the inventioiL The primer 
CACC dq)icted in the top illustration is homologous to the coding strand of flie gene of 
interest, and has the "FLAP" sequence added to its 5' end. Standard PGR amplification of 
the gene of mterest in the presence of the appropriate primers, including the CACC 
containing primer, gives the product depicted in flie bottom illustration. The product is a 
double stranded gene of interest amplicon with flq> sequence at its 5' end 

Figure 17 illustrates double stranded nucleic acid vectors of the present invention, 
including a TOPO FLAP cloning vector, which possesses a single stranded overhang can 
fidlitate insertion of amplified DNA towards proper orientation. Once conectly inserted, 
topoisomerase will ligate the product to tiie vector. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides conq>ositions and methods of using strand 
invasion to directionally or non-directionally link two or more double stranded 
(ds) nucleic add molecules. For exaniple, the present invention provides a ds nucleic 
acid molecule having a first end and a second end, wherein the first end contains a first 
5' overhang and a first topoisomerase covalentty bound at the 3* tmninus, and the second 
end contains a second topoisomerase covalentiy bound at the 3* terminus and contains a 
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second 5' overhang, a blunt end, a 3' uridine overhang, or a 3' thymidine overhang, 
wherein the fiffst 5' overhang is diflferent fom the second 5' over^ The first 
topoisoina:ase and second topoisomerase can be the same or differezit The first 
5' overhang can have any nucleotide sequence, including, for exanq>le, the nucleotide 
5 sequmce 5-GGTG, 

Aspects of the present invention modify topoisomeraso-mediated cloning so as to 
allow DNA firagments, including PCR-generated ORFs, to be directionally inserted into 
cloning vectors, while maintaining the advantages provided by ligation using 
topoisomCTase. The system greatly reduces the amount of work involved in screening to 
1 0 identify clones containing inserts in the desired orientation by enabling directional 
cloning efficiaicies that are routinely in excess of 90%. The present invention 
streamlines hi^ throughput gene expression operations and reduces costs associated with 
the screening process, and provides additional advantages. 

A topoisomerase-charged ds nucleic acid molecule of the invention generally has 
IS a single stranded overhang and a first topoisomerase covalently bound at or near a 

terminus of a first end. In addition, a topoisomerasercharged ds nucleic acid molecule of 
the invention can include a second topoisomerase covalently boimd at or near a terminus 
of the second end The single stranded overhang can be a 5' overiiang, and each 
topoisomerase can be bound at or near one or both 3' termini. Where a topoisomerase is 
20 bound to one, or preferably both, 3* termini, the second end of the topoisomerase-charged 
ds nucleic acid molecule of tiie present invention typically is a bhmt aid, a 3* thymidine 
overhang, or a second 5' ov^ang that is difiT^rent £rom the first S* overhang. 

As used herein, reference to a nucleic add molecule having "a first end" and "a 
second end" means that the nucleic add molecule is linear. Ihe term "single stranded 

25 overhang" or "overhang" is used herein to refer to a strand of a ds nucldc acid molecule 
that extends beyond the tominus of tiie complementary strand of the ds nucleic acid 
molecule. The term "5' overhang" or "5' overhangmg sequence" is used herein to refer to 
a strand of a ds nucldc acid molecule tiiat extends in a 5' direction beyond the 3' terminus 
of the cOEDplemmtary strand of flie ds nucleic acid molecule. The term "3* overhang" or 

30 "3' overlianging sequence" is used herein to refer to a strand of a ds nucldc add molecule 
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that extends in a 3' direction beyond the 5' tenninus of the complementary strand of the ds 
nucleic acid molecule. Conveniently, a 5' overhang can be produced as a result of site 
specific cleavage of a ds nucleic acid molecule by a type IB topoisomerase (see 
Examples 1 and 2). Similariy, a 3' overhang can be produced upon cleavage of a 
ds nucleic acid molecule by a type lA or type 11 topoisomerase. 

The 3' overhang and 5' overiiang can have any nucleotide sequmce and can be any 
length ( e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, etc. nucleotides), but will generally be at least two 
nucleotides. The overhangmg sequences can be selected such that the allow ligation of a 
predetemuned end of one ds nucleic acid molecule to a predetOTnlned end of a second 
nucleic acid molecule according to a method of the invention. Where the double stranded 
nucleic acid molecules are directionally linked, the 3' or 5' overhangs are generally not 
palindromic because ds nucleic acid molecules having palindromic overhangs can 
associate with each other, thus reducmg the yield of a directionally linked recombinant 
nucleic acid molecule comprising two or more ds nucleic acid molecules in a 
predetermined orioatation. The overhang can comprise, for example, a imcleotide 
sequence of a transcriptional or translational regulatory element such as a promoter, 
Kozak sequence, start codon, or the like, or a complement of such a nucleotide sequence." 

A 3' overhang or 5' overhang can include virtually any nucleotide or nucleotide 
analog or modified nucleotide tiiat can hybridize with a complementary nucleotide 
residue, provided that at least a portion of the nucleotide sequence of the overhang can 
hybridize with the complementary sequrace. Thus, tiie nucleosides in a overhang can 
include naturally occurring nucleotides such as purines (guanosine (G) or adenosme (A)), 
or pyrimidines (tiiymidine (T), uridine (U) or cytidine (Q). Additionally, the overhang 
can include substitutes for the nucleosides, for ©cample, a nucleoside such as inosine, or a 
modified form of a nucleoside such as methji guanosme, or a 5-halog»iated pyrimidine 
nucleoside (e.g., 5-bromodeoxy uridine or 5-mefhyl deoxycytidine). If desired, the 
ovahang can have a relatively high GC content, for exanople, the ovediang can have a 
greater lhan 50% GC content, such as 66% GC or 75% GC or 80% GC or 100% GC 
cantent In one embodiment, the oveifaang has the sequence 5<XjTG3\ 



wo 02/16594 



PCT/USOl/26294 



21 

A 5' or 3' overihiang of a first nucleic acid molecule, for example, can include one 
or two or a few nucleotide residues, for example, at the firee terminus of the ovediang, for 
which a complementary nucleotide residue is not present in the complementary sequence 
at or near the substantially blunt mi of the second (or other) ds nucldc acid molecule to 
5 which it is being linked Nevertheless, the ovediang at the end of the first nucleic acid 
molecule can selectively hybridize to the complementary sequence of flie second nucleic 
acid molecule due to the other nucleotide residues in the oveifaang. For exanq>le, where a 
5' ovediang consists of six nucleotides, the 5 -most one or two nucleotides need not be 
conq)lementary to the corresponding nucleotides in the complemmtary nucleotide 

10 sequence in the second nucleic acid molecule, but selective hybridization nevertheless can 
occur due to the complementarity of the remaining four nucleotide residues. The number 
or specific positions of non-complementary nucleotide residues that can be in an 
ovediang (or in the "complementary" sequence in the second nucleic add molecule) 
without substantially reducing or inhibiting hybridization specificity can be detearmined 

1 S using routme hybridization methods. 

The nucleotide residues of the ovediang can include locked nucleic acid ("LNA") 
analogues (PioUgo; Boulder CO). LNA monomers are bicyclic compounds that are 
structurally similar to ribonucleosides. The tenn "Lodced Nucldc Add" was coined to 
emphasize that the fiiranose ring conformation is restricted in an LNA by a methylene 

20 link^&at connects &e2*-Oporition to the 4'-C position. As used herein, all nucldc add 
molecules containing one or more LNA modifications are referred to as LNA molecules. 
LNA oligomers obey Watson-Crick base pairing rules and hybridize to complementary 
oligonucleotides. LNA can provide vastly unproved hybridization, stability, and 
increased themial stability porformance whm compared to DNA and olfaar nucldc add 

25 derivatives in a numb^ of situations ^shkm et al., Tetrahedtpn 54:3607-30, 1998; 
Koshkin al., LAm. Chem.Soc. 120:13252-53, 1998; Wahlestedt et al., ProcNatl, 
Acad. Sd.. USA 97:5633-38,2000). 

It should be recognized that referraice to a first end or a second end of a ds nucldc 
add molecule is not intended to imply any particular orientation of the nucldc acid 
30 molecule, and is not uitended to inq>ly a relative importance of the ends with respect to 



wo 02/16594 



PCT/USOl/26294 



22 

each other. Where a nxicleic acid molecule having a first end and second end is a double 
stranded nucleic acid molecule, each end contains a 5' terminus and a 3' terminus. Thus, 
reference is made herein, for example, to a nucleic acid molecule containing a 
topoisomerase recognition site at a 3' terminus and a hydroxyl group at the 5* terminus of 
S the same end, which can be the first end or the second end. 

Topoisomerase when bound to a nucleic acid molecule, will generally be bound 
"at or near" a terminus of a ds nucleic acid molecule. The tenn "at or near" when used 
with respect to a topoisomerase, means that the topoisomerase is covalently bound to one 
strand of a ds nucleic acid molecule such that it can ligate the temiinus of flie strand to 

10 which it is bound, to a second nucleic acid molecxile containing a fi-ee 5' terminal 

hydroxyl group. Generally, the topoisomerase is "at or near" an end by virtue of being 
covalently bound to one terminus of the end. For example, where the topoisomerase is a 
type IB topoisomerase such as a Vaccinia topoisomerase, the topoisomerase is bound at 
the 3' temiinus of an end of a ds nucleic acid molecule. However, an end having a 

IS topoisomerase covalently bound to a terminus of the end also can contain a single 
stranded overhang sequence in the complem^tary strand, thus extending beyond the 
terminus to which the topoisomerase is bound. Such a topoisomerase is an example of a 
topoisomerase near an end of the ds nucleic acid molecule. 

As used haein, the term "isolated," when used in reference to a molecule, means 
20 that the moleciile is in a form other than that in which it exists in nature. lhgeneral,an 
isolated nucleic acid molecule, for example, can be any nucleic acid molecule that is not 
part of a genome in a cell, or is sq)arated physically fix)m a cell tfiat normally contains the 
nucleic acid molecule. It should be recognized that various compositions of the invention 
comprise a mixture of isolated ds nucleic acid molecules. As such, it will be understood 
25 that the temi "isolated" only is used in respect to &e isolation of the molecule fiom its 
natural stat^ but does not indicate that the molecule is an only constituent 

Topoisomerases are a class of enzymes that modify the topological state of DNA 
via the breakage and rejoining of DNA strands (Shuman et al., U.S. Pat. No. 5,766,891, 
incorporated herein by reference). Topoisomerases are categorized as type I, including 
30 type lA and type IB topoisomerases, which cleave a single strand of a double stranded 
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nucleic acid molecule, and type n topoisomerases (gyrases), which cleave both strands of 
a nucleic acid molecule. As disclosed herein, type I and type II topoisomerases, as well 
as catalytic domains and mutant forms thereof, are useful for generating directionally 
linked recombinant nucleic acid molecules according to a method of the invention* 
5 Type n topoisomerases have not geaeceSly been used for generating recombinant nucleic 
add molecules or cloning procedures, \^ereas type IB topoisomerases, are used in a 
variety of procedures. 

Typo lA and IB topoisomerases cleave one strand of a ds nucleic acid molecule. 
Cleavage of a ds nucleic acid molecule by type lA topoisomerases generates a 

10 5' phosphate and a 3' hydroxy! at the cleavage site, with the type lA topoisomerase 
covalently binding to the 5' terminus of a cleaved strand In comparison, cleavage of a 
ds nucleic acid molecule by type IB topoisomerases generates a 3* phosphate and a 
5* hydroxyl at the cleavage site, with the type IB topoisomerase covalmtly binding to the 
3' terminus of a cleaved strand. Type lA topoisomerases include, for example, E, coli 

15 topoisomerase I and topoisomerase m, eukaryotic topoisomerase n, and archeal reverse 
gyrase (see Berger, BinRhi'm. fti ophvs. Acta 1400:3-18, 1998, which is incorporated 
h^in by reference). 

Type IB topoisomerases include the nuclear type I topoisomerases present in all 
eukaryotic cells and those oicoded by Vaccinia and other cellular poxviruses (see Cheng 

20 et al., £dl 92:841-850, 1998, which is incorporated herein by reference). The eukaryotic 
type IB topoisomerases are exemplified by those expressed in yeast, Drosophila and 
mammalian cells, including human cells (see Caron and Wang, Adv. PhaimacoL 
29B,:271-297, 1994; Gupta et al., Binnhim Rin phvs. Acta 1262:1-14, 1995, each of 
which is incorporated herein by reference; see, also, Berger, supra, 1998), Vtial type IB 

25 topoisomerases are exemplified by those produced by the vibrate poxvkuses {Vaccinia, 
Shope fibroma virus, ORF virus, fbw^x virus, and moUuscum contagiosmn virus), and 
the insect poxvmis (Amsactamoorei entomopoxvirus) (see Shuman, Biorfiim TlinpliYH 
Ma 1400:321-337, 1998; Petersen et al., Vgologv 230:197-206, 1997; Shuman and 
Ptescott, Proc. NatL Acad. ScL USA 84:7478-7482, 1987; Shuman. J.BioLChem. 
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269:32678-32684, 1994; U.S. Pat No. 5,766,891; PCT/US95/16099; PCT/US98/12372, 
each of which is incorporated herein by refeeace; see, also, Cheng et al., supra^ 1998), 

Type n topoisomerases include, for example, bacterial gyrase, bacterial DNA 
topoisomerase IV, eukaryottc DNA topoisomerase n, and T-even phage encoded DNA 
5 topoisomerases (Roca and Wang, Cdl 71 :833-840, 1992; Wang, J. Biol, Chem. 266:6659- 
6662, 1991, each of which is incorporated herem by reference; Beiger, supra, 1998). 
Like the type IB topoisomerases, the type n topoisomerases have both cleaving and 
ligating activities. In addition, like type IB topoisomerase, substrate ds nucleic acid 
molecules can be prepared such that the type n topoisomerase can form a covalent 

10 Unkage to one strand at a cleavage site. For example, calf thymus type II topoisomerase 
can cleave a substrate ds nucleic acid molecule containing a 5* recessed topoisomerase 
recognition site positioned three nucleotides from the 5' end, resulting in dissociation of 
the three nucleic acid molecule 5' to the cleavage site and covalent binding of the 
topoisomerase to the 5* terminus of the ds nucleic acid molecule (Andersen et al., supra, 

15 1991). Furthermore, \spon contacting such a type n topoisomCTase-charged ds nucleic 
acid molecule with a second nucleic acid molecule containing a 3' hydroxyl group, the 
^e n topoisomerase can Ugate the sequences togefhor, and then is released from the 
recombinant nucleic add molecule. As such, type n topoisomerases also are use&l for 
performing methods of the invoation. 

20 Structural analysis of topoisomerases indicates that the members of each particular 

topoisomerase femilies, including type lA, type IB and type n topoisomerases, share 
common structural features with other members of the family (Berger, supra, 1998). In 
addition, sequence analysis of various type IB topoisomerases indicates that the stmctures 
are highly conserved, particularly in the catalytic domam (Shuman, supra, 1998; Oieng et 

25 al., supra, 1998; Petersen et al., supra 1997). For example, a domain comprising amino 
acids 81 to 314 of the 314 amino acid Vaccinia topoisomerase shares substantial 
homology with other type IB topoisomerases, and ttie isolated domain has essentially the 
same activity as Oe fiiU length tqpoisomaase, althon^die isolated domain has a Acm& 
turnover rate and lower binding a£Bnity to the recognition site (see Shuman, supra^ 1998; 

30 Cheng et al., st^ra^ 1998). bi addition, a mutant Vacdnia topoisomerase, which is 
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mutated in the amino terminal domain (at amino acid residues 70 and 72) displays 
identical properties as the fiill length topoisomwase (Cheng et al., supra^ 1998). hi feet, 
mutation analysis of Vaccinia type IB topoisomerase reveals a large number of amino acid 
residues that can be mutated without aflfecting the activity of the topoisomerase, and has 
5 identified several amino acids that are required for activity (Shuman, supra^ 1998). In 
view of the high homology shared among the Vaccinia topoisomerase catalytic domain 
and the other type IB topoisomerases, and the detailed mutation analysis of Vaccinia 
topoisomerase, it will be recognized that isolated catalytic domains of the type IB 
topoisomecases and type IB topoisomerases havmg various amino acid mutations can be 
10 used in the methods of the invention and thus are considered to be topoisomerases for 
purposes of the present invention. 

The various topoisomerases exhibit a range of sequence specificity. For example, 
type n topoisomerases can bmd to a variety of sequences, but cleave at a highly specific 
recognition site (see Andersen et aL, J. Biol Caiem. 266:9203-9210, 1991, which is 

15 incorporated herein by reference.). In comparison, the type IB topoisomerases include 
site specific topoisomerases, whidi bind to and cleave a specific nucleotide sequence 
("topoisomerase recognition site"), l^on cleavage of a ds nucleic acid molecule by a 
topoisomerase, for example, a type IB topoisomoruse^ the energy of the phosphodiester 
bond is conserved via the formation of a phosphotyros^ Imkage between a specific 

20 tyrosine residue in the topoisomerase and the 3' nucleotide of the topoisomerase 

recognition site. Where the topoisomerase cleavage site is near the 3* texminus of the 
nucldc acid molecule, the downstream sequrace (3* to the cleavage site) can dissociate, 
leaving a nucldc add molecule having the topoisomerase covalently bound to the newly 
generated 3' end (see FIG 9). 

25 The covalently bound topoisomerase also can catalyze fiie reverse reaction^ for 

example, covalent linkage of the 3' nucleotide of the recognition sequence, to which a 
type IB topoisomerase is linked througji the phosphotyiosyi bond, and a nucldc acid 
molecule containing a fi:ee S' hydroxyl group. As sudi, methods have been develcqied for 
using a type IB topoisomerase to produce recombinant nucldc acid molecd As such, 

30 clonmg vectors contaimng a bound type IB topoisomerase have be» developed and are 
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commercially available (Invitrogen Corp., Carlsbad CA), Such cloning vectors, when 
linearized, contain a covalently bound type IB topoi5omra:ase at each 3' end 
("topoisomerase-charged"). Nucleic acid molecules such as those comprising a cDNA 
library, or restriction fragments, or sheared genomic DNA sequences that are to be cloned 
5 into such a vector are treated, for example, with a phosphatase to produce S' hydroxyl 
termini, then are added to the linearized vector under conditions that allow the 
topoisomerase to ligate the nucldc acid molecules at the 5' terminus containing the 
hydroxyl group and the 3' terminus containing tiie covalently bound topoisomerase. A 
nucleic acid molecule sudi as a PGR an^lification product, which is produced containing 
10 5* hydroxyl ends» can be cloned into a tqpoisomerase-charged vector in a r^id joining 
reaction (approximately 5 min at room temperature). The rapid joining and broad 
t^nperature range inherent to the topoisom^:ase joining reaction makes the use of 
topoisomerase-charged vectors ideal for hi^ throughput i^lications, which generally 
are performed using automated systems. 

15 Vaccinia virus encodes a 3 14 amino acid type I topoisomerase enzyme capable of 

site-specific single-strand nicking of double stranded DNA, as well as 5* hydroxyl driven 
religatioiL Site-specific type I topoisomerases include, but are not limited to, viral 
topoisomerases such as pox virus topoisomerase. Exanq>les of pox virus topoisom^ases 
include Shope fibroma vuus and ORF vkus. Other site-specific txqmisomerases are well 

20 known to fiiose skilled in the art and can be used to practice fliis invention. 

Vaccinia topoisom^nse binds to diq)lex DNA and cleaves the phosphodiester 
backbone of one strand ^v^iile exhibiting a hig^ level of sequence specificity. Qeavage 
occurs at a consensus pentapyrimidine element 5"-(C/T)CCTT4^, or related sequences in 
the scissile strand. In one embodim^t the scissile bond is situated in tfie range of 2 to 

25 12 bp &om tiie 3' end of the dxxplsK DNA. In another embodiment cleavable con^ilex 
formation by Vaccinia topoisomerase requires six dvplesi nucleotides upstream and two 
nucleotides downstream of the cleavage site. Examples of Vaccinia topoisomerase 
cleavable sequences include^ but are not limited to, -F6/-6 dnploK GCCCTTATTCCC 
(SBQ ID N0:1), +8M duplex TCGCOSTIATTC (SEQ ID N0:2), +10A2 duplex 

30 TGTCGCeSlAT (SEQ ID N0:3), +11/-1 dxsphsK GTGTC GCCCTTA (SEQ ID N0:4). 
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Examples of other sit&-specific type I topoisomerases are well known in the art 
These enzymes are encoded by many organisms includmg, but not Ihnited to 
Saccharomyces cerevisiae, Saccharomyces pombe and Tetrahymena, however^ flie 
topoisomerase I enzymes of these q)ecies have less specificity for a consensus sequence 
5 than does die Faccinia topoisomerase. fLvnn et al,. Proc. Natl. Acad Sci. USA 86: 3559- 
3563, 1989; Eng et al., LBioLChePL 264: 13373-13376, 1989; Busk et al.. Nature 327: 
638-640, 1987). 

The compositions and methods of the mvention are exemplified generally herein 
wifli reference to the use of type IB topoisomerase such as the Vaccinia topoisomCTase. 

10 However, it will be recognized that the methods also can be performed using other 
topoisomerases merely by adjusting the componrats accordingly. For example, as 
described in greater detail below, methods are disclosed for incorporating a type IB 
topoisomerase recognition site at one or both 3* termini of a ds nucleic acid molecule. 
Accordiagly, in view of the present disclosure, the artisan will recognize that a 

1 5 topoisomerase recognition site for a type lA or type n topoisomerase similarly can be 
incorporated into a ds nucleic acid molecule, 

A topoisomerase-charged ds nucleic acid molecule that contains a 5' overhang on 
a first end generally contains a topoisomerase covalently bound to the 3' terminus of the 
first rad. A dsnucldc add containing a 5* overhaiig and first topoisomerase at a first end, 

20 also can contain a second topoisomerase covalently bound to the second end. The 

topoisomerase coval^tly bound to the first end can be the same as or different fiiom the 
topoisomerase covalently bound to the second end. Thus, a Vaccinia topoisomerase can 
be covalently bound to a first md and anotiier poxvirus or eukaryotic nuclear type IB 
topoisomerase can be bound to the second end. Graerally, where the topoisomerases at 

25 each end are different, they are members of the same general fiimily, for example, type lA 
or type IB or type n topoisomerase. 

In one ^bodiment, a topoisomerase-charged double stranded nucleic add 
molecule of the invCTtion is a vector, which can be a cloning vector or an expression 
vector. The vector can include elements such as a bacterial origin of rq)lication, a 
30 eukaryotic origin of replication, antibiotic resistance genes, and the like, and can fiuther 
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include topoisomerase recognition sites or topoisomerase-charged ends or a combination 
thereof. Such vectors of the invention can conveniently be packaged into kits as 
disclosed herein. A vector of the invention can be a plasmid vector, a cosmid vector, an 
artificial chromosome (e.g., a bacterial artificial chromosome, a yeast artificial 
S chromosome, a mammalian artificial chromosome, etc.) or a viral vector such as a 
bacteriophage, baculovirus, retrovirus, lentivirus, adenovirus, Vciccinia virus, semliki 
forest virus and adeno-associated virus vector, all of which are well known and can be 
purchased fix>m comm^al sources (Promega, Madison Stratagene, La Jolla CA; 
GIBCO/BRL, Gaithersburg MD). Wal expression vectors can be particularly usefiil 
10 i^ere a method of the invention is practiced for the pwpose of generating a directionally 
linked recombinant nucleic acid molecule that is to be introduced into a cell, particularly 
a cell in a subject Viral vectors provide the advantage that fhey can infect host cells with 
relatively high efELciency and can infect specific cell types or can be modified to infect 
particular cells in a host 

15 Viral vectors have been developed for use in particular host systems and include, 

for example, baculovirus vectors, which infect insect cells; retroviral vectors, other 
lentivirus vectors such as those based on the human immunodeficiency virus (EHV), 
adenovirus vectors, adeno-associated virus (AAV) vectors, herpesvirus vectois, Vaccinia 
virus vectors, and the like, which infect mammalian cells (see Mill^ and Rosman, 

20 BioTechniaues 7:980-990, 1992; Anderson et aL, Nature 392:25-30 SuppL, 1998; Verma 
and Somia, Nature 389:239-242, 1997; Wilson, New End. J.Med 334:1 185-1 187, 1996, 
each of which is incorporated herehi by reference). For exaiiq>le, a viral vector based on 
an HIV can be used to infect T cells, a viral vector based on an adenovirus can be used, 
for exanq)le, to infect respuatory epithelial cells, and a viral vector based on a herpesvirus 

25 can be used to infect neuronal cells. Other vectors, such as AAV vectors can have greater 
host cell range and, therefore, can be used to infect various cell types, altliou^ viral or 
non-viral vectors also can be modified with specific receptors or ligands to alter target 
specificity throu^ receptor mediated events. 

A linearized vector of the invention, which is topoisomerase-charged or contains 
30 topoisomerase recognition sites can be aerated using methods as disclosed herein or 
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otherwise known in the art. For sample, a circular vector can be linearized, and 
modified by ligating or hybridizing one or more oligonucleotides, to generate a 
topoisomerase recognition site, or a cleavage product thereof; and a target 5' sequence or 
5" overhang, at one or both aids (see Examples 1 and 2). The vector also can contain, for 
5 example, ei^ression control elements required for replication in a prokaryotic host cell, a 
eukaryotic host cell, or both, and can contain a nucleotide sequence encoding a 
polypq)tide that confers antibiotic resistance or the Iflce, or such elements can be 
introduced info the vector using the methods of the invention. Furthermore, the vector 
can contain one, two, or more site specific integration recognition site such as an att site 
10 or lox site. The mcorporation, for example, of attB or attP sequences mto an isolated 
nucleic acid molecule of &e presrat mvention allows for the convenient manipulation of 
the nucleic acid molecule using flie GATEWAY™ Clonmg System (hwitrogra CJorp., La 
JolIaCA). 

The invention provides a modified cloning vector having an overhanging single 
15 stranded piece of DNA charged with topoisomerase. The modified vector allows the 
directional insertion of linear ds nucleic acid molecules, for example PCR amplified, or 
otherwise suitable ORFs, for subsequent expression, and takes advantage of 
topoisomerase cloning efiBcimcy. As used harem, the term donor signifies molecules 
such as a duplex DNA which contains a S '-CCCTT cleavage site near the 3 ' end, md the 
20 tmn acceptor signifies a duplex DNA which contains a S'-OHtenninus. Once covalently 
activated by topoisomerase the donor will be transferred to those acceptors to vMch it has 
single strand sequence conq>lementatioa 

According to the present invention, in particular embodiments topoisomerase- 
modified vectors are fiuth^ ads^ted to contain at least one S* single-stranded oveihang 
25 sequence to facilitate the directional insertion of DNA segments. A nucleic acid molecule 
to be cloned into such a vector can be a PCR product constituting an ORF, which can be 
expressed fix>m the resultant recombinant vector. The primers used for amplifying the 
OSF are designed such that at least one primer of the primer pair contains an additional 
sequence at its 5* end Tliis sequence is dedgned to be complementary to the sequent 
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the S' single-stranded overhang present in the topoisomerase-modified vector of the 
present invention. 

The present invention generally provides methods for generatmg a directionally or 
non-directionally linked recombinant nucleic acid molecule, by using a strand invasion 
5 event and a ligation event to link, in a directional or non-directional manner, a first 
nucleic acid molecule and at least a second nucleic acid molecule. As used herein, the 
term "strand invasion" refers to the displacement of one strand of a first double stranded 
nucleic acid molecule by a single stranded portion of a second nucleic acid molecule, 
wherein the single strand has nucleotide sequence that is substantially identical to the 
10 displaced strand and can selectively hybridize to the strand complementary to the 
displaced strand. 

A method for generating a directionally or non-directionally linked recombinant 
nucleic acid molecule can be performed, for example, by contacting a first ds nucleic acid 
molecule having a first overhang on a first strand (e.g., a 3' or S' strand) at a first end; and 

15 a second ds nucleic acid molecule having a first substantially blunt end and a second end, 
wherein a nucleotide sequence that is complementary to the first overhang of the first end 
of the first nucleic add molecule, is present at or near the first substantially blunt end. 
The method is performed under conditions such that the first overhang can selectively 
hybridize to the complementary nucleotide sequence of the first substantially blunt end of 

20 the second ds nucleic acid molecule, and the first end of tiie first ds nucleic acid molecule 
and the first end of the second ds nucleic acid molecule can be linked. The first overhang 
can be a 3' overhang or a 5' oveihang. The invention fiirther provides precursor nucleic 
acid molecxiles ^^ch can be used to prq>are molecules suitable for use in the method 
described above. The invention also provides nucleic acid molecules prepared by the 

25 above method. 

Figure 1 illustrates examples of ways in which the methods of the invention can 
be used to generate a covalentiy linked recombinant nucleic acid molecule. The boxes 
and cux^les in Figure 1 are used to d^ict regions of sequence complementarity such that a 
strand displacement event can occur. The other end of the ds nucleic acid molecules, 
30 which do not necessarily (but can) mvolve a strand displacement event, can have any 
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structure, including, can be substantially blunt or can include a 3* or 5' overhang. Other 
combinations of blunt ends and/or oveifaangs on the iSrst, second, third, etc. ds nucleic 
acid molecules can be linked according to the methods of the invention and will be 
evident to those m the art based, in part, on the examples provided in Figure 1. 

S As shown in Figure lA, a method for generating a directionally or 

non-directionally linked recombinant nucleic acid molecule can be performed, for 
example, by contacting a first ds nucleic acid molecule, which has a first overhang on a 
first strand at a first end; a second ds nucleic acid molecule, which has a first substantially 
blxmt end and a second end, wherein the first substantially blunt end has a nucleotide 

10 sequence that is complementary to the first overhang of the first end of the first nucleic 
acid molecule; and a reagent for ligating the nucldc acid molecules (e.g., a reagent 
conq>rising a topoisomerase, a ligase, or a recombinase). The method is performed under 
conditions such that the first ovaiiang can selectively hybridize to the con]5)lementary 
nucleotide sequence of the first substantially blunt end of the second ds nucleic acid 

1 5 molecule. Furthermore, the method is performed in the presence of a reagent that can 
ligate a 5* strand of one nucleic acid molecule to a 3' strand of a second nucleic acid 
molecule, and voider conditions such that the 3' terminus of the first end of the first ds 
nucleic acid molecule and the 5' terminus of the first end of the second ds nucleic acid 
molecule are linked In various modifications of the method described above, as well as 

20 in other methods desoibed above, flie first oveiiiang canbe a 3* ovediang or a 
5' oveihang. 

As shown in Figure IB, a mefliod for generating a linked, for example 
directionally linked, recombinant nucleic acid molecule can be performed by contacting a 
first ds nucleic acid molecule with a first overhang on a first strand at a first ^d and a 

25 second substantially blunt end; and a second ds nucleic acid molecule, which has a first 
substantially blunt end and a second end which has a second overhang, wherein the first 
substantially blunt end of tiie second ds nucleic acid molecule has a nucleotide sequence 
tiiat is complementary to the first ov^ang of tiie first end of the first nucleic acid 
molecule, and wherem the second substantially blunt end of the finst nucleic add 

30 molecule has a nucleotide sequence that is complementary to the second ov^faang of the 
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second end of the second ds nucleic acid molecule. The method is perfomied under 
conditions such that the first oveifaang can selectively hybridize to the complementary 
nucleotide sequence of the first substantially blunt end of the second ds nucleic acid 
molecule, and wherein the second oveiiumg can selectively hybridize to the 
5 complementary nucleotide sequence of the second substantially blunt end of the first ds 
nucleic acid molecule. Furthermore, the method may be p^onued in the presence of a 
reagent which can catalyze the ligation of a 3' strand of one nucleic acid molecule to a 
S' strand of anoth^ nucleic acid molecule, and und^ conditions such that flie 3' terminus 
of the first end of the first ds nucleic acid molecule is linked to the 5' temiinus of the first 

1 0 end of the second ds nucleic acid molecule, and the 3' tominus of the second substantially 
blunt Old of the first ds nucleic acid molecule is linked to fiie S' terminus of the second 
end ofthe second ds nucleic acid molecule. In various modifications of the method 
described above, one or both the first and second ovofaangs can be 3' ov^iiangs or 
5' overiiangs. The ds nucleic acid molecules can thus engage in two separate strand 

1 S invasion events which, upon covalent linkage of nucleic acid strands at each termini, 
result m flie formation of a circular recombinant nucleic acid molecule. 

As shown in Figure IC, a method for generating a linked, for example 
directionally linked, recombinant nucleic acid molecule can be performed, for exan^le, 
by contacting a first ds nucleic acid molecule with a first overhang on a first strand at a 

20 first end and a second end having a second overhang and a second ds nucleic add 

molecule, whidi has a first substantially blunt md and a second substantially blunt md, 
^erem the first substantially blunt aid ofthe second ds nucleic acid molecule has a 
nucleotide sequence that is conq>lanentary to the first ov^jiang ofthe first end ofthe first 
nucldc add molecule, and ^dierein the second substantially bhmt end ofthe second 

25 nucldc add molecule has a nucleotide sequence that is complementary to fiie second 
ovorhang of the second end of the fixst ds nucldc add molecule. The me&od may be 
performed under conditions such that the first ovoiiang can selectively hybridize to the 
cocq)lementary nucleotide sequence ofthe first substantially blunt end of tiie second ds 
nucldc add molecule, and wherdn the second overhang can selectively hybridize to the 

30 complementary nucleotide sequence of the second substantially blunt end ofthe second 
ds nucldc acid molecule. Furtfaennore, tiie method is performed under conditions such 
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that tilie first end of the first ds nucleic acid molecule is linked to the first end of the 
second ds nucleic acid molecule, and flie second end of the first ds nucleic add molecule 
is linked to the second substantially blunt end of the second ds nucleic acid molecule. In 
various modifications of the method described above, one or both (he first and second 
S overhangs can be 3* overhangs or 5* overhangs. The ds nucleic acid molecules can thus 
engage in two separate strand invasion events, vMch^ i4)on covalent linkage of nucleic 
acid strands at each tCTnini, result in the formation of a circular recombinant nucleic acid 
molecule. 

As shown in Figure ID, a method for generating a linked, for example 

10 directionally linked, recombinant nucleic acid molecule can be pezformed, for exan^le, 
by contacting a first ds nucleic acid molecule, which has a first overhang on a first strand 
at a first end; a second ds nucleic acid molecule, which has a first substantially blunt end 
and a second substantially bhmt end; and a third ds nucleic acid molecule which has a 
second ovohang on a first strand at a first end, wherem the first substantially bhmt end of 

IS the second ds nucleic add molecule has a nucleotide sequence that is conoqplementary to 
the first ov^hang, and the second substantially blunt end of the second ds nucldc acid 
molecule has a nucleotide sequmce that is complementary to the second overhang. The 
method is performed under conditions such ftat the first ovediang can selectively 
hybridize to the complementary nucleotide sequence of the first substantially blunt end of 

20 the second ds nucldc add molecule^ and whmin the second overhang can selectively 
hybridize to the complementary nucleotide sequence of the second substantiaUy blunt end 
of the second ds nucleic acid molecule. Furthomore, the mettiod may be performed 
under conditions such that the first end of the first ds nucldc add molecule is linked to 
flie first ead of flie second ds nucldc add molecule^ and the first end of the fliird ds 

25 nucldc add molecule is linked to the second substantially bhmt end of the second ds 
nucldc add molecule. In various modifications of flie method described above, one or 
botii the first and second overhangs can be 3' overhangs or 5* overhangs. 



As shown in Figure 1 E» a method for gqaeratmg a Imked, for exsaaaple 
directionally Unked, recombinant nucldc acid molecule can be performed, for example, 
30 by contacting a first ds nucldc acid molecule, which has a first substantially bhmt end; a 
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second ds nucleic acid molecule which has a first overhang on a first strand at a first end 
and a second overhang on a second strand at a second end; and third ds nucleic acid 
molecule, which has a second substantially blunt end, wherein the first substantially blunt 
end of tiie first ds nucleic acid molecule has a nucleotide sequence that is complementary 
5 to die first overhang of the first end of the second nucleic acid molecule, and wherem the 
second substantially blunt end has a nucleotide sequence that is complementary to the 
second overhang of the second end of the second ds nucleic acid molecule. The method 
is p^ormed under conditions such that tiie first overhang can selectively hybridize to the 
complementary nucleotide sequence of the first substantially blunt end of the first ds 

10 nucldc acid molecule, and wherein the second ov^ang can selectively hybridize to the 
complCTientary nucleotide sequence offlie second substantially blunt end. FurtfaCTnore, 
the metiiod may be p^ormed under conditions such that the first substantially blunt end 
of the first ds nucleic acid molecule is linked to tiie first end of the second ds nucleic acid 
molecule, and the second substantially blunt end^ located on the thud ds nucleic acid 

1 5 molecule, is linked to tiie second end of tiie second ds nucleic acid molecule. In various 
modifications of the method desoibed above, one or botii the first and second oveifaangs 
can be 3* ovetfaangs or S* oveifaangs. 

A method for generating a directionally or non-^ectionally linked recombinant 
nucleic acid molecule can be performed, for exaniple, by contacting a first ds nucleic acid 

20 molecule, which has a first topoisomerase covalentiy bound at or near a first substantially 
blunt end; and a second ds nucleic acid molecule, which has a first 3* ovediang on a first 
strand at a first end, wherem tiie first substantially blunt end of tiie first ds nucleic acid 
molecule has a nucleotide sequence tiiat is conqilementary to the first 3* overiiang (see 
Figure2). The method is perfonnedund^ conditions such tiiat the first 3' ovesimg 

25 selectively hybridize to tiie complementary nucleotide sequence of tiie first substantially 
blunt end of tiie first ds nucleic add molecule. Fiuihmnore, the method is performed 
sudi that topoisomerase can covalenfly link the 3 ' terminus of the first end of the first ds 
nucleic acid molecule to the 5' terminus of tiie first mi of the second ds nucleic acid 
molecule (Cheng and Shuman, Mol.Cell.BioL 20:8059-8068, 2000, whidi is 

30 incorporated herem by reforence in its entirety). 
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The ability of a topoisomerase covalentty bound at or near a first substantially 
blunt end of a first ds nucleic acid molecule to covalently linked to a second ds nucleic 
acid molecule with a 3* overhang (Figure 2) iUuminates a previously uni^ipreciated 
conformational flexibility of covalently bound topoisomerase with respect to its DNA 
5 contacts on the 5' side of llie sdssile phosphodiester. In catalyzing the relaxation of 
si^ercoiled DNA, covalmtly bound topoisomoase type IB releases the downstream 
diq)lex and permits rotation of the duplex around the phosphodiester bond opposite the 
sdssile phosphate before resealmg the backbone. 

A method of the present invention can be performed in a manner that obviates the 
10 need to perform an additional reaction to tepsdr a remaining nick aflCT the homology 
dependent ligation, by substantially replacmg one strand of a nucleic acid molecule (see 
Figure 2). For example, the methods can be p^ormed such that the ov^ang sequence 
of one nucleic acid molecule extrads the entire length of and, iqnm strand invasion, 
replaces a strand of the oth^ ds nucleic acid molecule. Thus, in diis embodiment, fliere is 
IS no nick in tiie strand that was not ligated by the topoisomerase. 

The termini of the ds nucleic acid molecxiles that are linked using the methods of 
the current invention can be covalmtly linked, using any reagent useful for linking a 
5* terminus of a one nucleic acid molecule to a 3' terminus of a second nucleic add 
molecule. Thus, the reagent for covalently linking the termini can be, for exaii:q>le, a 

20 topoisomerase, including a type lA, type IB or type n topoisomerase; a ligase such as 
T4 DNA ligase; a recombinase, including FLIP recombinase, Int mtegrase, or ere 
recombinase; or another INT femily member (see, for example, Nucl, Acids Res. 26:391- 
406, 1998). Furthermore, where one nick remains after a ligation of one strand, the nick 
can be closed by an m vivo ligation, for exanq)le by introduction into a cell, such as 

25 E. coli, of the nicked ds nucldc acid molecule. In certain prefened embodiments, the 
linldng of tihe two ends involved in a strand displacement eveat involves topoisomerase 
ligation. Furdiennore, in a mediod as disclosed for linldng a fiostdsnw^ 
molecule and a second ds nucldc acid molecule, a diird nucldc add molecule also can be 
Unked to an ead of the first or second nucleic add molecule fhat is not involved in a 

30 strand displacement event 
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Methods of the present mvration can be performed to link a first ds nucleic acid 
molecule to at least a second ds nucleic acid molecule in a non-directional or, preferably, 
a directional manner. However, the methods can be used to non-directionally Unk the first 
ds nucleic acid molecule and the second ds nucleic acid molecule m embodiments where 
5 complementarity exists hetwem nucleotide sequences at or near a tmnimis of bofli ends 
of one of the ds nucleic acid molecules and nucleotide sequences at or near a terminus of 
at least one end of the other ds nucleic add molecule. Such comploientarity between 
nucleotide sequences at or near a terminus of both ends of one ds nucleic acid molecule 
and at least one strand of the second ds nucleic acid molecule can be achieved, for 
10 example, by including identical nucleotide sequences at the same tCTninus (i.e., 5' or 3*) 
of both ends of a ds nucleic add molecule. This can be accomplished, for example, by 
designing target sequences that can be cleaved with the same restriction enzyme and 
which contain the same nucleotide sequence. 

The present invention also relates to methods of directionally or non-directionally 
15 linking a first and at least a second nucleic acid molecule, including, as desired, 

operatively linking two or more (e.g., 2, 3, 4, 5, 6, 7, etc.) of flie nucldc acid molecules. 
A method for generatiiig a directionally or non-durectionally linked recombmant nucleic 
acid molecule can be p^otmed, for exanq>le, by contacting a topoisomerase-charged 
first ds nucleic acid molecule, ^ch has a first topoisom^ase oovalmtly bound at a first 
20 end, and a second topoisomerase covaleotly bound at a second end, and also contains a 
S' oveiiiang at the first cad and a blunt end, a 3* uridine overhang, a 3* thymidine 
overhang, or a second 5* overiiang at the second mi; and a second ds nucldc add 
molecule, which has a first blunt end and a second end, wherein the first blunt eod has a 
S' nucleotide sequmce fliat is complem»tary to die first 5* ovofliang of die first end of the 
25 first nucldc acid molecule. The first and second topoisomerases can be die 8ame» for 
example, two type IB topoisomsnses, mcluding two Vaccinia type IB tppoisomerases, or 
can be di£Eerent, mcluding two type IB topoisomerases firom diff^ent organisms or a 
type IB topoisomerase and a type lA or a type n topoisomerase. 

In performing a method of die faivCTtion, the first and second ds nucleic add 
30 molecules are contacted under conditions such that the 5' nucleotide sequence of die 
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second nucleic acid molecule can selectively hybridize to the first 5' overhang, whereby 
the first topoisomerase can covalently link the 3* terminus of the first end of the first 
ds nucleic acid molecule to the S* terminus of the first end of the second ds nucleic acid 
molecule, and the second topoisomerase can covalently link the 3' terminus of the second 
5 end of the first ds nucleic acid molecule to tfie 5* terminus of the second end of the second 
ds nucleic acid molecule» to generate a directionally or non-directionally linked 
recombinant nucleic acid molecule. Accordhigly, die present invention provides a 
directionally or non-directionally linked recombinant nucleic acid molecule produced by 
such a method. 

10 As disclosed herein^ a method of the invention can provide a means to 

directionally link two or more ds nucleotides in a predetennined dkectional orientation. 
The term "directionally link is used herein to refer to the covalent linkage of two or 
more nucleic acid molecules in a particular predetermined ord^ and/or orientation. Thus, 
a method of the invention provides a means, for example, to covalently link a promoter 

IS expression control element upstream of a coding sequrace, and to covalentiy link a 
polyadenylation signal downstream of the coding region to generate a fimctional 
e}q>ressible recombmant nucleic acid molecule; or to covalmtiy link two coding 
sequences such that they can be transmbed and translated in fiiame to produce a fiision 
polypeptide. The temi "nonrdirectionally link " is used hsmn to refer to the covalmt 

20 linka^ of two or more nucleic acid molecules in a random order, i.e., eitho: of the first or 
second end of the nucleic acid molecule can be linked to an end of anotho: nucleic acid 
molecule. 

The term "substantially blunt," wh^ used in reference to an end of a ds nucleic 
acid molecule, means that the end can be blunt or can have a short oveifaang that does not 
25 reduce or inhibit a strand invasion event by a second nucleic acid molecule havmg an 
ov^hang. For e3ranq>le, a substantiaUy blunt aid can include an end having an o^ 
of 1 , 2, or a few nucleotides, provided flie ovediang at tiie substantially bhmt end does not 
block strand invasion. For example, tiie second dsnucldc add molecule can have a 
5' adenosine or 5' inosine overhang. 
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It should be tecognized that reference herem to a "first nucleic acid molecule," 
"second nucleic acid molecule," "third nucleic acid molecule," and the like, is used only 
to provide a means to indicate which of several nucleic acid molecules is being referred 
to. Thus, absent any specifically defined characteristic with respect to a particular nucleic 
5 acid molecule, the terms "firs^" "second," "&ird" and the like, when used in reference to 
a nucleic acid molecule, or a population or plurality of nucleic acid molecules, are not 
intended to indicate any particular order, importance or other information about flie 
nucleic acid molecule. Thus, where an exemplified me&od refers, for example, to using 
PGR to amplify a first ds nucleic acid molecule such that the anq)lification product 
10 contains a topoisomerase recognition site at one or both ends, it will be recognized that, 
sinularly, a second (or other) ds nucleic acid molecule also can be so anq>lified 

The term "at least a second," when used in reference to a ds nucleic acid molecule, 
means one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, etc.) nucleic acid molecules in 
addition to a first ds nucleic add molecule. Thus, the term can refer to only a second 

IS nucleic acid molecule, or to a second nucleic acid molecule and a third nucldc add 
molecule (or more). As sudi, the term "second (or otibt^) ds nucleic acid molecule" or 
"second (and other) ds nucldc acid molecules" is used herein in recognition of the fact 
that the term "at least a second nucldc acid molecule" can refer to a second, third or more 
nucldc add molecules. It should be recognized that, unless indicated otherwise, a nucldc 

20 acid molecule enconq^assed witiiin the meaning of the term "at least a second nucldc acid 
molecule" can be the same or substantially the same as a first nucldc add molecule. For 
exanq)le, a first and second ds nucldc acid molecule can be the same except that only one 
of the molecules, for example, the first ds nucleic acid molecule, has a topoisomerase 
recognition dte, or Gxcept for having complementary 5' overhanging sequCTces, for 

25 exanq)le, produced upon cleavage by a topoisomerase^ such that tiie first and second ds 
nucldc add molecules can be directionaUy linked using a method of tiieinvm^ As 
sudi, a metiiod of ttie invention can be used to produce a concatenate of first and second 
ds nucldc add molecules, which, optionally, can be mterq)ersed, for example, by a third 
ds nucldc add molecule such as an expression control element, and can contain the 

30 directionally linked sequences in a predetermined directional orientation, for example, 
each in a 5' to 3* orientation with respect to each otfa^. 
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It will be recogmzed that each of the ds nucleic acid molecules, for example, a 
sequence referred to as a first ds nucleic acid molecule, generally comprises a population 
of such nucleic acid molecules, which are identical or substantially identical to each other. 
Thus, it should be clear that the temi "different" is used in comparing, for sample, a first 
S (or population of first) ds nucleic acid molecules witih a second (and other) ds nucleic acid 
molecule. As used herein, the term "different," when used in reference to the ds nucleic 
add molecules of a composition of the invention, means that the ds nucleic add 
molecules share less than 95% sequmce identity with each when optimally aligned, 
generally less than 90% sequmce identity, and usually less than 70% sequrace identity. 

10 Thus, ds nucleic acid molecules that, for exanq)le, di£fer only in bdng polymorphic 

variants of each other or tiiat merely contain different 5' or 3' ovohangmg sequences are 
not considered to be "different" for puiposes of a conq>osition of the invention. In 
comparison, differCTit ds nucleic acid molecules are exemplified by a first sequence 
encoding a polypq>tide and second sequence comprising a expression control element, or 

15 a first sequence encoding a first polypq)tide a second sequrace encoding a 
non-homologous polypq)tide. 

The term "recombinant" is used herein to refer to a nuddc acid molecule that is 
produced by linking at least two nucldc acid molecules. As sudi, a recombinant nucleic 
acid molecule encompassed within or generated according to a method of the invention is 

20 distinguishable fi:om a nucldc acid molecule that may be produced in nature, for 

example, during mdosis. A recombinant nucldc acid molecule gmerated according to a 
method of die invention can be identified, for example, by tiie presence of die 
complementary nucldc acid sequence in close proximity, generally directiy adjacmt, and 
usually diiecdy 3', to a tqx)isomecase binding dte in a double stranded nucleic acid 

25 molecule. 

As disclosed herdn , a method of die invention can be used to diiectionally or 
non-diiectionally link a first ds nucldc add molecule to a second ds nucldc acid 
molecule. In nianyemlx)diments, the method may be used to directionaUylin^ 
nucldc acid molecule and a second (or other) ds nucldc acid molecule. However, use of. 
30 the mediod to non-dnectionally hnk a first ds nucldc add molecule and a second (or 



wo 02/16594 



FCT/USOl/26294 



40 

Other) ds nucleic acid molecule also provides advantages. Non-directional linking can be 
performed, for example, 1) where a second nucleotide sequence is present at or near the 
5' terminus of the second end of the first ds nucleic acid molecule, which can form all or 
part of a second overhangs and is cspsble of hybridizing to die 5' conq)lemffltary 
S nucleotide sequmce of tiie second ds nucleic acid molecule; and 2) where a nucleotide 
sequence is present at or near the 5* terminus of boHi Ike first end and Ike second md of 
the second ds nucleic acid molecule that is cq)able of hybridizmg to fhe S* oveihang of 
the first end of fiie first ds nucleic acid molecule. In these embodimmts, the second end 
of the first ds nucleic acid molecule and the second end of the second ds nucleic acid 
10 molecule can be either blmit, or include an oveihang. 

In another embodiment, a method for generating a directionally or 
non*<]irectionally linked recombinant nucleic acid molecule can be performed, for 
example, by contacting a first precursor ds nucleic acid molecule having a first end, 
which has a first 5' target sequence at the S* terminus and a topoisomerase recognition site 

IS at or near the 3' terminus, and a second end, which has a topoisomerase recognition site at 
or near the 3* terminus; a second ds nucleic acid molecule having a filrst blunt end and a 
second end, wherein the first blunt end has a 5* nucleotide sequence complementary to the 
5' target sequmce of the first precursor ds nucleic acid molecule; and a topoisomerase that 
is specific for the topoisomerase recognition site. As used herein, reference to a 

20 "precursor" ds nucldic add molecule means a ds nucldc acid molecule that contains a 
topoisomerase recognition site and that, upon cleavage by a topoisomerase specific for 
fhe recognition site, produces an end having a desired 5' terminal nucleotide sequence, 
3* terminal nucleotide sequence, or bofli. Such a desired end is produced, in part, due to 
fhe presence of the 5* target sequence, vMch, vipon cleavage of fhe ds nucleic acid 

25 molecule contaming fhe 5' target sequence, is converted to a 5* nucleotide sequence that 
allows directionally linkmg fhe ds nucleic acid molecule to a second ds nucleic add 
molecule. 



30 



Accordiiig to a meOiod of fhe mvention, the first ds nucldc acid molecule, second 
ds nucldc add molecule and topoisomerase are contacted under conditions that allow 
topoisomerase activity, i.e., sudi that flie topoisomerase can bind to and cleave the 
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recognition site, to produce a topoisomerasc-charged 3' terminus, and can ligate the 
3' terminus to an qjpropriate 5' terminus. Such conditions also allow hybridization of the 
portion of the portion of the first 5* target sequmce ttiat remains following cleavage by 
the topoisomerase and the S' nucleotide sequence of the second ds nucleic acid molecule 
5 ttiat is complementary to that portion of the 5' target sequence. 

In performing a method of the invention, a precursor ds nucleic acid molecule can 
be combined in the same reaction vessel and at the same time with the topoisomerase and 
the second ds nucleic acid molecule before the preciusor ds nucleic acid molecule is 
converted into a topoisomerase-charged ds nucleic acid molecule that can be directionally 
1 0 linked to a second ds nucleic acid molecule. By combining the topoisomerase in the same 
reaction vessel as the precursor ds nucleic acid and the second nucleic acid, the methods 
of the present invention are simplified Altematively, a first precursor ds nucleic acid 
molecule can be combined with topoisomerase under conditions that allow topoisomerase 
cleavage and binding, thai a second ds nucleic add molecule can be added. 

15 A precursor ds nucleic acid molecule can be linear or circular, including 

supercoiled, and, as a result of cleavage by one or more topoisomerases and, if desired a 
restriction endonuclease, a linear topoisomerase-charged first ds nucleic acid molecule is 
produced For example, a cuxnilar ds nucleic acid molecule containing two type IB 
topoisomerase recognition sites within about 100 nucleotides of each other and in the 

20 complementary strands, preferably within about twenty nucleotides of each other and in 
the complementary strands, can be contacted with a site specific type IB topoisomerase 
such that each strand is cleaved and the intervening sequence dissociates, diereby 
generating a Imear ds nucleic acid molecule hamg a topoisomerase covalentiy bound to 
each end 

25 In general, a topoisomerase-charged double stranded nucleic acid, which can be 

directionally linked to a second or other ds nucleic acid molecule, is generated by 
contacting topoisomerase with a precursor ds nucleic acid that has at least one 
topoisomerase recognition site at or near one end and a first target sequence. Asused 
herein, tiie tram "topoisomerase recognition site" means a defined nucleotide sequence 

30 that is recognized and bound by a site specific topoisomerase. For example, the 
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nucleotide sequence 5'-(C/T)CCTr-3' is a topoisomerase recognition site that is bound 
specifically by most poxvirus topoisomerases, including Vaccinia virus DNA 
topoisomerase I, whidi then can cleave the strand after the 3'-most thymidine of the 
tecogoition site to produce a nucleic acid molecule conqyrising 
5 5'-(Cyr)CCTT-P04-TOPO, i.e,, a complra of the topoisomerase covalently bound to the 
3' phosphate through a tyrosine residue in the topoisomoase (see Shuman, X Biol. Chem. 
266:11372-11379, 1991; Sekiguchi and Shuman. NucL Acids Res. 22:5360-5365. 1994; 
each of which is incorporated herem by reference; see, also, U.S. Pat. No. 5,766,891; 
PCT/US95/16099; PCT/US98/12372). 

10 An advantage of constructing a precursor ds nucleic add molecule to comprise, 

for exanq>le, a type IB topoisomerase recognition site about 2 to 15 nucleotides (e.g., 2, 3, 
4, 5, 6, 7, 8, 9, 12, 14, 15, or 20 nucleotides) fix>m one or both ends is tiiat a 5' ovezhang is 
generated following cleavage of tiie ds nucleic acid molecule by a site specific 
topoisomoase. Such a 5' overhanging sequence, which would contain 2 to 

15 20 nucleotides, respectively, can be designed using a PGR mettiod as disclosed herein to 
have any sequence as desured. Thus, where a cleaved first ds nucleic acid molecule is to 
be directionally linked to a selected second (or other) ds nucleic acid molecule according 
to a method of the invention, and where the selected sequence has a 5' overhanging 
sequence, the 5* ovohang on the first ds nucleic acid molecule can be designed to be 

20 complementary to the 5 * overhang on the selected second (or other) ds sequence such tiiat 
the two (or more) sequences are directionally linked in a predetermined orientation due to 
the complementarity of the 5' overhangs. As discussed above, similar methods can be 
utilized with respect to 3' ovwhanging sequences gmerated ij^Jon cleavage by a type lA or 
type n topoisomoase. 

25 As used herem, the t^m "cleavage product," vrbsa used in re&smco to a 

topoisomerase recognition sit^ refm to a nucleic add molecule that has been cleaved by 
a topoisomerase, generally at its recognition site, and conqmses a conoplex of the 
topoisomerase covalmtiy bound, in ttie case of a type IB topoisomerase to the 
3' phosphate groiq> of tiie 3' terminal nucleotide in the topoisomerase recognition site, or, 

30 in tiie case of type lA or ^e n topoisommse, to the 5' phosphate ffoxsp of tiie 
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5' tenuiial nucleotide in Aetopoisomera^ Such a complex, i;iiiich 

comprises a topoisomeiase cleaved ds nucleic add molecule having the topoisom^tase 
covalmtly bound Hx&coto, is lefmed to h^^ as a "topoisomerase-activated" or a 
"topoisomerase-chaiged" nucleic acid molecule. Topoisomerase-activated ds nucleic acid 
5 molecules can be used in a method of the invention, as can ds nucleic acid molecules that 
contain an uncleaved topoisomerase recognition site and a topoisomerase, wherein the 
topoisomorase can cleave the ds nucleic acid molecule at the recognition site and become 
covalendy bound thereto. 

As will be readily ^parent from the present disclosure, the ends of ds nucleic add 

10 molecules to be linked according to a method of the invmtion can have various 

characteristics. For example, m one aq>ect, the second end of a first precursor ds nucldc 
acid is a blunt end upon cleavage by the tqpoisomoase, and the second end of a second ds 
nucleic acid molecule is a blunt end. In another aspect, the second end of a first precursor 
ds nucleic acid molecule has a 3* thymidine octension iqpon cleavage by the 

1 5 topoisomerase, and the second end of a second ds nucldc acid molecule has a 

3' adenosine overiiang, or the termini can comprise 3* adenosme and 3' deoxyuridine 
overhangs (see U.S. Pat Nos. 5,487,993 and 5,856,144, each of i^ch is incorporated 
herdn by reference). In yet another aspect, a first precursor ds nucldc add molecule has 
a second 5' target sequmce at the second end, and the second ad of a second ds nuddc 

20 add molecule has a 5* nucleotide sequence conq>lem6ntary to at least a portion of the 
second 5* target sequence. The first precursor ds nucldc acid molecule can be a vector, 
including a cloning vector and an expression vector, and, where generally present in a 
circular form, can be linearized due to the action of the topoisomerase, or can be 
linearized by mcluding, fixr example, one or two restriction endonucleases that linearize 

25 the vector such that, \xpon contact with die topoisomerase, the first and second ds nucleic 
add molecules can be dnectionally linked accordmg to a method of the invention. 

As used herem^ the term "at or near;" vfhea used m reference to the proxunity of a 
topoisomerase recognition dte to the 3' (^e IB) or 5* (type lA or type O) terminus of a 
nucldc acid molecule, means that the site is widun about 1 to 1 00 nucleotides fit>m die 
30 3' terminus or 5' terminus, respectively, generally withm about 1 to 20 nucleotides bom 
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the tenninus, and particularly within about 2 to 12 nucleotides fix)m the req)ective 
terminus. An advantage of positioning flie topoisoxnerase recognition site within about 10 
to 1 S nucleotides of a terminus is that, upon cleavage by the topoisomerase» &e portion of 
the sequence downstream of ttie cleavage site can spontaneously dissociate fiom the 
5 remaining nucleic acid molecule, which contains the covalently bound topoisomerase 
(referred to generally as "suidde cleavage"; see, for example, Shuman, stq^ra^ 1991; 
Andersen et aL, supra^ 1991). Where a topoisomerase recognition site is greater than 
about 12 to IS nucleotides fixmi the terminus, the nucleic acid molecqle iqpstream or 
downstream of the cleavage site can be induced to dissociate fiom the remamder of fhe 
10 sequence by modifying fhe reaction conditions, for example, by providing an incubation 
stq> at a t^nperature above the melting tenq>eraturB of fhe portion of (he diq>Iex inchiding 
the topoisomerase cleavage site. 

A method of the invention using a first precursor ds nucleic add molecule having 
a S' target sequence and a topoisomerase on a first end and a second end, can be 

1 S performed to directionally or non-directionally link a first precursor ds nucleic acid 
molecule to a second ds nucleic acid molecule. The method typically is used to 
dnectionally link the first precursor ds nucldc add molecule and fhe second ds nucleic 
add molecule, and also can be used to non-dkectionally link fhe first precursor ds nucldc 
acid molecule and the second ds nucldc acid molecule. Non-directional linking can be 

20 performed, for exanq)le, 1) where a second nucleotide sequence is present at or near the 
tmninus of fhe second end of fiie first precursor ds nucldc add molecule, that is 
ciq>able of hybridizing to the 5' complemCTtary nucleotide sequence of the second ds 
nucldc acid molecule; and 2) where a nucleotide sequence is present at or near the 
5* tmninus of both fhe first mA, and flie second end of fiie second ds nucldc acid 

25 molecule fliat is cq)able of hybridizing to the 5' target sequences at or near Ae first end of 
fhe first precursor ds nucldc add molecule. In these embodiments, the second end of fhe 
first precursor ds nucldc add molecule and the second end of flie second ds nucldc acid 
molecule can be dther blunt, or include an overhang. 

A method for generatmg a directianaUy or non-duiectionaUy Im^ 
30 nucldc add molecule also can be poformed by contacting a topoisomerase-charged first 
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ds nucleic add molecule, which has, at a first end, a first 5' ov^ang and a fiist 
topoisomerase covalently bound to the 3* terminus, and a second ds nucleic add 
molecule, which has a first blunt md and a second end, herein the first blunt end 
includes a 5' nucleotide sequence complementary to the first S' ovo^iang. The method is 
5 perfomied under conditions such tfiat die S' nucleotide sequence of fiie first blunt end can 
selectively hybridize to the first S' overhang, whereby the first topoisomerase can 
covalently link the 3' terminus of the first end of the first ds nucldc add molecule with 
the S' terminus of the first end of the second ds nucldc acid molecule. 

Such a method can fiuiher mclude contacting the topoisomerase-charged first ds 

10 nucldc acid molecule and the second ds nucldc acid molecule with a third ds nucleic acid 
molecule, wherein a first end of the third nucleic ds add molecule has a S' oveifaang and a 
second topoisomerase covalently bound at the 3' terminus, and wherein flie second ds 
nucleic add molecule has a second blunt end, ^ch includes a 5' nucleotide sequence 
complementary to the 5' ovetfaang of the third nucleic add molecule. Hie contacting can 

15 be performed, for sample, under conditions such that the 5' nucleotide sequence of the 
second blunt end of the second ds nucleic acid can selectively hybridize to the 
5* overhang of the first end of the third ds nucleic acid molecule, whereby the second 
topoisomerase can covalentiy hnk die 3* terminus of the first end of die third ds nucldc 
acid molecule witii die 5' tmninus of die second blunt end of the second ds nucldc acid 

20 molecule. The first and second topoisomerases can be the same or difEerent and, if 
desired, the first or third ds nucldc add molecules, instead of bdng topoisomeras&- 
diarged, can contain a topoisomerase recognition site, wh^tdn the meiiiod can fiirth^ 
include contactmg the reactantswidi a topoisomerase. A metiiodoftiie invention can be 
performed such that the first ds nucldc acid molecule is dnectionally linked to tiie second 

25 ds nucldc acid molecule and, tiiareaft^, the thkd ds nucldc acid molecule is directionally 
or non-dkectionally linked to the second ds nucldc add molecule, or all of the reactants 
can be included to^tiier at the same time. 



A mediod of tibie invmtion provides a means to render an open mading from a 
cDNA or an isolated genomic DNA sequmce expressible by operatively linking one or 
30 more e(xpression control elements to the putative coding se^ Bxamplesof 
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^ression control elements usefiil in the present invention are disclosed herein and 
include transcriptional expression control elements, translational e7q>ressi0n control 
elements, elements that facilitate the transport or localization of a nucleic acid molecule 
or polypq>tide in (or out of) a cell, elements that confer a detectable phenotype, and the 
S like. Transcriptional expression control elements include, for example, promoters such as 
fliose from cytomegalovirus, Moloney leukemia virus, and herpes virus, as well as those 
fiom die gmes encoding metallofhionein, skeletal actin, phosphoenolpyruvate 
caiboxylase, phosphoglycerate, dihydrofolate reductase, and Aymidine kinasci, as well as 
promoters from viral long terminal repeats (LIKs) such as Rous sarcoma virus LTR; 

10 enhancers, which can be constitutively active such as an immunoglobulin enhancer, or 
inducible such as S V40 enhancer, and the like. For example, a metallothionein promoter 
is a constitatively active promoter that also can be induced to a higher level of expression 
upon exposure to a metal ion such as copper, nickel or cadmium ion. In comparison, a 
tetracycline (tet) inducible promoter is an example of a promote that is induced upon 

15 exposure to tetracycline, or a tetracycline analog, but otherwise is inactive. 

A transcniptional expression control element also can be a tissue specific 
expression control elem^t, for example, a muscle cell specific expression control 
element, such that expression of an encoded product is restricted to the muscle cells in an 
individual, or to muscle cells in a mixed population of cells in culture fbr example, an 

20 organ culture. Muscle cell specific expression control elements including, for exanq)le^ 
the muscle creatine kinase promoter (St^nberg et al., MoL Cell Biol. 8:2896-2909, 1988, 
which is incorporated beaxm by refa[irace) and the myosin ligjbt chain enhancer^promotor 
OOono^e et al., Proc, Natl Acad. Sci.. USA 88:5847-5851, 1991, which is mcoiporated 
ha:ein by reference) are well known in flie art Oth^ tissue specific promoters, as well as 

25 expression control elements only expressed during particular developmental stages of a 
cell or organism are well known in the art. 

Expression control or other elements usefiil m generating a construct accordmg to 
a method ofthe invention can be obtained in various ways. In particular, many of the 
elements are included in commercially available vectors and can be isolated therefiom 
30 and can be modified to contain a topoisomerase recognition site at one or both ends, for 
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example, using a PGR method as disclosed hetmxL la addition, the sequences of or 
encoding the elements usefiil herein generally are well known and disclosed in 
publications. In many cases, the elements, for example, transcriptional aiid transladonal 
expression control el«ients, as well as cell comparbnentalization domains, are relatively 
S short sequences and, therefore, are amenable to chmical synthesis of the element or a 
nucleotide sequence encoding fhe element Tlius, in one embodiment, an elemrat 
comprising a composition of fhe invention, usefiil in generating a recombinant nucleic 
add molecule according to a method of the invention, or included within a kit of the 
invention, can be chemically syn&esized and, if desired, can be synthesized to contain a 
1 0 topoisomerase recognition site at one or both ends of the elemmt and, further, to contain 
an ovetfaangmg sequence following cleavage by a site specific tppoisomoase. 

Where ds nucleic acid molecules are to be directionaUy linked according to a 
method of the invention, the nucleic acid molecules gmsacaUy are operatively linked such 
that the recombinant nucleic acid molecule that is generated has a desked structure, 

IS perfonns a desned Amotion, encodes a desired expression product, or fhe As used 
herdn, the temi "op^atively Imked" means diat two or more nucleic acid molecules are 
positioned with respect to each other such that tiiey act as a unit to effect a function 
attributable to one or both sequences or a combination thereof. For example, a nucleic 
• add molecule containing an open reading fiame can be operatively linked to a piomotCT 

20 such that ttie promoter confers its regulatory effect on fhe open reading firame similarly to 
the way in ^^ch it would effect ejqiression of an open reading fiame that it nomially is 
associated with in a genome in a celL Similarly, two or more nucleic acid molecules 
conqxrising open reading fiames can be operatively linked in firame such that, upon 
transcrqition and translation, a chimeric fusion polypeptide is produced. 

25 A first ds nucldc acid molecule comprising an open reading fi:ame can be 

amplified using any amplification method, for example, by PGR using a primer pair, to 
gmerate an amplified first ds nucldc acid molecule having a S* nucleotide sequence 
conq>lementary to a 5' overhang of a topoisommse-charged ds nucldc acid molecule of 
the present invention. Where botfi ends oftfae amplified first ds nucleic add molecule are 

30 complonents of two ovediangs on fhe topoisomerase-chaiged ds nucldc add molecule, 
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the 5' ovediaiigs are different fiom each other. The amplified first ds nucleic acid 
molecule then can be contacted with the topoisomerase-charged ds nucleic add molecule 
comprising a desired expression control element such as a promoter such that the second 
nucleic acid molecule is opa:atively Imked to the 5' end of the coding sequence according 
S to a method of the invention. 

Various combinations of components can be used in a method of the invention. 
For example, the method can be performed by contacting a topoisomorase-activated first 
ds nucleic acid molecule; a second ds nucleic acid molecule having a first end and a 
second end, wh0:em at the first end or second ead or both, the second nucleic acid 

10 molecule has a hydroxyl gcoi^ at the 5* temiinus of the same and; and a S' oveifaang. 
Where fiie S ' terminus of one or both ends to be linked has a 5 ' phosphate groiq>, a 
phosphatase also can be contacted with the conq)onents of the reaction mixture. U^n 
contacting, fiie phosphatase, if necessary, can generate a 5* hydroxyl groiqp at the same 
end, and the second ds nucleic acid molecule th^ can be directionally linked to the 

IS tppoisomerase-activated first ds nucleic acid molecule. The skilled artisan will recognize 
other combinations of components usefiil for p^orming a metiiod of the invention. 

As used herein, Tefercnce to contacting a first nucleic acid molecule and a second 
nucleic acid molecule *'und^ conditions such that all conq)onents are in contact" means 
that the reaction conditions are appropriate for a topoisomerase-cleaved end of a first ds 

20 nucleic acid molecule to come into sufQcient proximity sudi tiiat a topoisomerase can 
effect its ^usymatic activity and covalentiy link the 3' terminus of the first ds nucleic acid 
molecule to a 5* hydroxyl groiqp at the tenninus of a second nucleic acid molecule. 
Bxanq;)les of such conditions include, fi>r exan^le, the reaction temperature, ionic 
strength, and pH. Additionally, such conditions can be determined estqpirically or using 

25 formulas that predict conditions for ^edfic hybridization of nucleic add molecules, as is 
wen known in the art (see, for example (Sambrook et aL, Molecular C loning! A 
lahgratATYinflniifll (Cold Spring Haibor Laboratory Press 1989); Ausubel et aL, Current 
Protocols in Molecular Biology. John Wiley and Sons, Baltimore, MD (1987, and 
supplements tiuou^ 199S), each of ^ch is incorporated herein by reference). 
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As disclosed herein, a PGR method using primens designed to mcx)rporate 
oonq>lemeataiy nucleotide sequences at one or both ends of an anqplified ds nucleic acid 
molecule provides an example of a convenient means for producing ds nucleic acid 
molecules useful in a method of the invCTlion. At least one of the primets of a pnmer 
5 pair is designed such that it comprises, in a 5' to 3' orientation, a nucleotide sequence 
complemGDtary to a first overhang on the topoisomerase-charged ds nucleic acid molecule 
of the preset invention* The second primCTofthe PGR primer pair can be 
complementary to a desu*ed sequence of the nucleic acid molecule to be empV&ed, and 
can comprise a second complementary sequence. 

10 A primer can contain or encode any other sequence of interest, including, for 

example, a site specific integration recognition site such as an att site, a lox site, or the 
like, or, as discussed above, can simply be used to introduce a topoisomerase recognition 
site into a dsnucldc acid molecule comprising such a sequence of interest A 
recombinant nucleic acid molecule generated according to a metiiod of the inv^tion and 

1 5 containing a site specific integration recognition site such as an att site or lox site can be 
integrated specifically into a desired locus such as into a vector, a gene locus, or the like, 
tiiat contains the required integration site, for example, an att site or lox site, respectively, 
and vpaa contact with the appropriate enzymes required for the site specific event, for 
example, lambda Lit and lEIF proteins or Cre recombinase, respectively The 

20 incorporation, for exdxaplc, of aUB or attP sequences into a directionally or non- 
directionally linked recombinant nucleic acid molecule according to a method of the 
invention allows for the convenient manqndation of the nucleic acid molecule usmg the 
GAIEWAY™ Cloning System CbvitrogenOnp., La JoUaCA). A fioistds nucleic add 
molecule used in a method of the invention can be a linearized vector containing two site 

25 specific integration sites, for exanq)le, an "entry vector" (GATEWAY™ Clpnmg System), 
and a method of the invention can be used to insert a second (or other) ds nucleic acid 
molecule between the site specific integration sites. 



A ds nucleic acid molecule to be used in a mdhod or kit of the invention can be 
amplified using any amplification reaction, for example, using tiie polymerase chain 
30 reaction (PGR), to contain a coinqplementary nucleotide sequence at a S* end. Although 
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exemplified by PCR, other amplification methods also can be used to amplify a nucleic 
acid molecule such that the amplified nucleic acid molecule has a complementary 
sequence at ttie S' tominus of one of its ends. The complementary nucleotide sequence is 
complementary to the 5' oveifaang on the topoisomecas^-charged ds nucleic acid to which 
5 tiie amplified nucleic acid will be ligated This complementarity facilitates die 

association of the nucleic add molecules m a predetermined orientation, whereupon they 
can be linked by topoisomerase according to a method of the invention. 

Amplification primers can be designed to impart particular characteristics to a 
desired ds nucleic acid molecule, for example, a ds nucleic acid molecule that encodes a 
10 transcriptional or translational expression control elemoit or a coding sequrace of interest 
such as an qpitope tag or cell compartmentalization domain, hi this aspect, the 
amplification primers are designed such that, iq>on amplification, the ds nucleic add 
molecule contains a 5' complementary sequrace at one or both ends, as desired. 

Amplification primers also can be used to amplify a directionally linked 
recombinant nucleic acid molecule generated according to a method of the invention. For 
example, a method of the invention can generate &om tiuee ds nucleic acid molecules, 
including a nucleic acid molecule comprising a promoter, a nucleic acid molecule 
comprising a coding sequence, and a nucleic add molecule comprising a po]yaden}4ation 
signal, an e?q)ressible recombinant nucldc acid molecule. The gen^tion of the nucldc 
add molecule is fecilitated by the incorporation of complementary 5' (or 3') sequences at 
the ^ds of the ds nucleotides sequences to be joined, wherdn preferably one of the 
complementary sequences is an ov^iiang sequence. 

As such, by designing a PGR primer pair containing a first primer that is specific 
for an overiiang of the nucleic acid molecule comprising the promote that is upstream 
25 fix>m the promoter, and a second primer that is specific for an overhang of the nucldc acid 
molecule comprising the polyad^^ation signal that is down stream of the signal, only a 
fiill lengtii fimctional recombitiant nucldc molecule containing the promoter, coding 
sequence and polyadenylation signal in the correct (predetermined) orientation will be 
amplified. In particular, partial reaction products, for sample, containmg only a 
3 0 promote: linked to the coding sequence, and reaction products contammg nicks are not 
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amplified Thus, PGR can be used to specifically design a ds nucleic acid molecule such 
that it is usefiil in a method of the invention, and to selectively ampMy only those 
reaction products having the deshed components and characteristics. 

A method of the invention can be performed such that a second ds nucleic acid 
S molecule to be directionally ligated to a first ds nucleic acid molecule, is one of a 

plurality of second ds nucleic acid molecules. As used herein, the term "plurality," when 
used in reference a first or at least a second nucleic acid molecule, means that &e nucleic 
acid molecules are related but different For purposes of the presmt invention, the 
nucleic acid molecules of a plurality are "related" in that each nucleic acid molecule in the 

10 plurality contains, for example, a 5* nucleotide sequence that is complementary to a 
5' oveiiiang sequence present on a topoisomerase-charged ds nucleic acid molecule to 
which the second ds nucldc add molecules are to be directionally linked Ftulhermore, 
die nucleic acid molecules of a plurality are "dififerrat" in that ihey can comprise, for 
example, a cDNA library, a combinatorial library of nucleic acid molecules, a variegated 

1 5 population of nucleic add molecules, or die like. Methods of making cDNA libraries, 
combinatorial libraries, libraries comprismg variegated populations of nucldc add 
molecules, and the like are well known in die art (see, for example, U.S. Pat No. 
5,837,500; U.S. Pat No. 5,622,699; U.S. PaL No. 5,206,347; Scott and Smith, Sdmce 
249:38^390, 1992; Markland et al., G^oe 109:13-19, 1991; O'Connell et aL, ProcNatl, 

20 Acad Sci.. USA 93:5883-5887, 1996; Ttaedc and Gold, Sdence 249:505-510, 1990; Gold 
et aL, Ann. Rev. Biochem. 64:763-797, 1995; each of vMch is incorporated herem by 
reference). 

Where a second ds nucldc add molecule is one of a population of ds nucldc acid 
molecules, a method of the invention can further utilize a population of first ds nucldc 

25 add molecules, eadi of which contains a different and randomly genoated nucleotide 
sequence at or near a 3* and/or 5' terminus of a first and/or second rad, for exanq>le, 
randomly generated 3' or 5* ovediangs at ornear a first end Sudi a population of 
randomly generated nucleotide sequences near an end will include conq>lementary 
sequences to nucleotide sequences at or near die ends of many or all of the second ds 

30 nucleic add molecules of die plurality. Thus, the mdfaod can be used to generate liilked 
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recombinant nucleic acid molecules, including many or all of the nucleic add molecules 
in the plurality of second ds nucleic acid molecules. 

The methods of the invention have broad application to the field of molecular 
biology. As discussed m greater detail below, flie methods of the invention can be used, 
5 for example, to label DNA or RNA probes, to generate sense or antisoise RNA, to 
prepare bait or prey constructs for performing a two hybrid assay, to prqiare linear 
expression elements, to prqiare constmcts useful for coupled in vitro 
transcription/translation assays, and to perform directional cloning. 

A directionally or non<-directionally linked recombinant nucleic acid molecule 
10 gena:ated according to this aspect of the invention can be linear, but preferably is circular, 
particularly a vector, as described above. The circular recombinant nucleic acid molecule 
can be generated such that it has the characteristics of a vector, and contains, for example, 
expression control elements reqxiired for replication in a prokaryotic host cell, a 
eukaryotic host cell, or both, and can contain a nucleotide sequence encoding a 
IS polypeptide that confers antibiotic resistance or the like. 

A method of the invention can furtfao- include introducing a directionally or non- 
directionally linked recombinant nucldc acid molecule into a cell, which can be a 
prokaryotic cell such as a bacterium <n: a oikaryotic cell such as a fnammftliflu cell. 
Accordingly, the present invention also provides a cell produced by a method of the 

20 invention, as weU as a non-human transgenic organism produced fixmi such a cell. An 
advantage of such a metihod is fbat the generated recombinant nucleic acid molecule, 
which is circularized according to a method of the invention, can be transformed or 
transfected into an ^propriate host cell, wherein the construct is amplified. Thus, an in 
vivo method using a host cell can be used for obtaining a laige amount of a circularized 

25 product generated according to a method of the invention. 

It should be recognized that a method of the invention is diaracterized, in part, in 
that a linked recombinant nucleic acid molecule geuerated fliereby either contains a nick 
m one strand, because a topoisomerase is attached to only one 3*terminus of the ends to be 
linked, or conqirises one strand that is derived completely bom only one of two nucleic 
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acid molecule linked according to fhe method. Wh^e the recombinant nucleic acid 
molecule contains a nick, the nick can be converted to a phosphodiester bond, if desired, 
for example, by contacting the nicked recombinant nucleic acid molecule witti a DNA 
ligase, by introducmg the nicked recombinant nucleic acid molecule into a cell such as a 
S bacterium that can rq>air the nick, or by any oflier method as desired. Thus,inone 
embodiment, a mediod of the inv»tion includes a strand invasion event and a ligation. 

Where a recombinant nucleic acid molecule generated according to a method of 
the invention does not comprise one strand tibat is derived completely fiiom only one of 
the startmg nucleic acid molecules, fhe method can further include a cleavage stq), 

10 wherein the displaced nucleotide sequCTce is cleaved fiom the product Such a cleaving 
step can be performed using any me&od knowa to cleave or degrade a single stranded 
nucleotide sequence, mcluding, for example, contacting a recombinant nucleic acid 
molecule comprising the displaced strand with an enzyme having, S* to 3' or 3' to S* single 
stranded nucleic acid exonuclease activity (dqiendii^ on the orientation of the displaced 

15 strand). Such a method convenientiy can be performed mvi£ro. Alternatively, the 

recombinant ds nucleic acid molecule can be introduced into a cell» for example an E. coli 
cell, wherein tiie displaced nucleotide sequmce is cleaved. 

A method of tiie inv^tion can be used to generate a directionally linked 
recombinant nucleic acid molecule encoding a chimoic fusion polypeptide. For 

20 generating such a recombinant nucleic acid molecule, a first and second (or ofh^) 

ds nucleic acid molecule each can encode all or a portion of an opoa reading frame, and 
the first and second (or other) ds nucleic acid molecules, which have first and/or second 
ends as disclosed herein, are directionally linked. The chimeric polypeptide can comprise 
a fiision polypqytide, in which the two (or more) encoded peptides (or polypeptides) are 

25 translated into a smgje product, Le., the peptides are covalmtly linked throng a peptide 
bond. 

For example, a first ds nucleic acid molecule can encode a cell 
compartmentalization domain, such as a plasma mmibrane localization domain, a nuclear 
localization signal, a mitodiandrial mmibrane localization signal, an endoplasmic reticulum 
3 0 localization signal, or the like, or a protein transduction domain such as tfie human 
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inumniodeficieacy virus TAT protdn transduction doman], vAAch can fedlitate translocation 

of a pq)tide linksed tfiCTeto into a cell (see SSchwaize et al., Sdmce 285:1569-1572, 1999; 

Derossi ^ al., J, BioL Oxeai. 271:18188, 1996; Hancock et al., BMBO J, 10:4033-4039, 

1991; Buss et al., Mol Cell. BioL 8:3960-3963, 1988; US. Pat No. 5,776,689 each of 
5 which is incoiporated herein by refia:ence). Snchadomaincanbeiise&ltotaig^afusion 

polypq)tide conqnising the domain and a polypeptide encoded by a second ds nucleic add 

molecule, to ^ch it is directionally linked according to a method of flie invention, to a 

particular compartment in the cell, or fi)r secretion fiom or entry into a cell. Assuch,tiie 

invmtion provides a means to gmerate directional^ linked recombinant nucldc add 
10 molecules that encode a diimaic polypeptide. 

A fusion polypq>tide expressed from a directionally linked lecombinant nucldc 

acid molecule generated according to a metiiod of tiie invmtion also can comprise a 

pq>tide having the characteristic of a detectable label or a tag sudi tiiat tiie e3q)ressed 

fusion polypeptide can be detected, isolated, or the like. For exanq)le, a first, second or 
15 other ds nucldc acid molecule containing a topoisomerase recognition site, or cleavage 

product fbsiGof^ as disclosed herein, can encode an enzyme sudi as alkaline phosphatase, 

d-galactosidase, chloramphenicol acetyltransferase, ludferase, or other enzyme; or can 

encode a peptide tag such as a polyfaistidine sequence (e.g., hexahistidine), a V5 epitope, 

a c-myc epitope; a hemagglutinin A q>itope, a FLAG epitope, or tiieh^ Expression of a 
20 fusion polypq>tide comprising a detectable label can be detected using the appropriate 

reagent, for example, by detecting light emission vpon addition of ludferin to a fusion 

polypeptide comprising ludferase, or by detecting binding of nickel ion to a fusion 

polypeptide comprising a polyfaistidine tag. 

Apolyfaistidine tag can conq)rise fix>m about two to about ten contiguous histidine 

25 residues (eg., two, three, four, five, six, seven, d^t, nine, or ten contiguous histidine 
residues). The tag can also be a pqitidet^ which binds nidcel ions, as weU as other 
metal ions (e.g., copper ion), and can be used for metal chelate affinity chromatography. 
Examples of such tags include peptides having the fonnula: Ri-CHis-X)^-!^^ wherem 
(His-X)n represents a metal chelating peptide and n is a number between two fluDugh ten 

30 (e.g., two, tiiree, four, five, six, seven, eigfht, nine, or ten), and X is an amino add selected 
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fiom the groiq) consisting of alanine, arginine, aspartic add, agparagine, cysteine^ 
glutamic acid, glutamine, glycine, histidine, iso-leudne, leucine, lysine, mefhionine, 
phenylalanine, proline, serine, threonine, tryptophan, tyrosine and valine. Further, R2 
may be a polypeptide which is covalently linked to the metal chelating pq>tide and Rl 
5 may be either a hydrogen or one or more (e.g., one, two, three, four, five, six, seven, 
eight, nine, ten, twenty, thirty, fifty, sixty, etc.) ammo add residues. In addition, RI may 
be a polypeptide which is covalently linked to fiie metal chelating pqptide and R2 may be 
dther a hydrogen or one or more (e.g,, one, two, three, four, five, six, seven, eight, nine, 
teiu twenty, thirty, fifty, sixty, etc.) amino add residues. Tags of this nature are described 
10 in U.S. Pat No. 5,594,1 1 5, the entire disclosure of which is incorporated herein by 
reference. 

Similarly, isolation of a fusion polypeptide comprising a tag can be performed, for 
example, by passing a fusion polypeptide comprising a myc q>itope over a column having 
an anti-c-myc qpitope antibody bound thereto, then eluting tiie bound fusion polypeptide, 

15 or by passing a fusion polypq>tide con^rising a polyhistidine tag over a nickel ion or 
cobalt ion afSnity column and eluting the bound fiision polypeptide. Methods for 
detecting or isolating such fusion polypq)tides will be well known to tiiose in the art, 
based on tiie selected detectable label or tag (see, for example, Hopp et al., 
BioTechnology 6:1204, 1988; U.S. Pat No. 5,011,912; each of which is mcorporated 

20 herein by reference). 

In one mbodiment, the directionally linked recombinant nucldc add molecules 
encode diimeric polypeptides useful for performing a two hybrid assay, in such a 
me&od, the first ds nucldc add molecule encodes a polypqitide, or a relevant domain 
thereof tiiat is suspected of having or being examined for the ability to mt^:act 

25 specifically with one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, etc.) other polypeptides. 
The second ds nucldc add molecule, to which the first ds nucldc add molecule is to be 
directioiudly finked according to a method of the uoLveotion, can encode a transcription 
activation domain or a DNAbindmg domain. For example, a first ds nucldc add 
molecules to be dhectionally linked is modified, for example, to contain a 5' ovedumg on 

30 a first end and a topoisomerase recognition site, or cleavage product thereof at or near the 
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first end A second dsnucldc acid molecules to be linked (M)ntaii^ 
contain, a 5' sequrace conq>lCTientary to (he S' ovecfaang at the first end of Ibe first 
ds nucleic acid molecule. Upon contact of the first and second ds nucleic add molecules 
with a topoisomerase, the directionally Imked nucleic acid molecule encodes a first 
5 hybrid usefiil for performing a two hybrid assay (see, for exanq)le. Fields and Song, 
Nature 340:245-246, 1989; Pat No. 5^83,173; Fearon et al., Proc. Natl. Acad. Sci., 
USA 89:7958-7962, 1992; Chicn et al., Proc. Nafl. Acad. Sci. USA 88:9578-9582, 1991; 
Young, BioL Rq>rod 58:302-31 1(1998), each of which is incorporated herein by 
reference). Similar methods are used to gen^e fiie second hybrid protein, which can 

10 comprise a plurality of polypeptides to be tested for the ability to interact with the 

polypeptide, or domam thereof of tibe first hybrid protein. Such methods similarly can be 
used to construct directionally linked nucleic acid molecules encoding fiision protein 
useful for a modified form of a two hybrid assay such as the reverse two hybrid assay 
(Leaima and Hannink, Nucl. Acids Res. 24:3341-3347, 1996, which is mcorporated 

15 h^ein by reference), the rq)ressed transactivator system (U.S. Pat. No. 5,885,779, which 
is incorporated herein by reference), the protein recruitment system (U.S. Pat 
No. 5,776,689, which is incorporated herein by reference), and the like. 

As disclosed herein, a second ds nucleic acid molecule can be one of a plurality of 
nucleic acid molecules, for example, a cDNA library, a combinatmial Hhrary of nucleic 

20 add molecules, or a population of variegated nucleic acid molecules. As such, the 
methods of the invention are particularly usefiil for g^erating recombinant 
polynucleotides encoding chimeric polypq>tides for perfi>nning a hi^ throughput two 
hybrid assay for identifying protein-protein interactions that occur among populations of 
polypeptides (see U.S. Pat No. 6,057,101 and U.S. Pat No. 6,083,693, each of ^ch is 

25 mcorporated herein by reference). In such a metiiod, each of tiie hybrid proteins of the 
two hybrid assay is generated usmg a different one of two populations (pluralities) of 
nucleic acid molecules encoding polypeptides, each plurality having a complexity of fitom 
a few related but diff^^t nucleic acid molecules to as hi^ as tens of thousands of such 
molecules. By performing a mettiod of the invention, for example, using a PGR primer 

30 pair to amplify each nucleic acid molecule in a plurality, directionally linked recombinant 
polynucleotides encoding a population of chimeric bait polypeptides and a population of 
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chimeric prey polyp^tides readily can be generated. Such populations are generated by 
contacting the amplified pluralities of nucleic acid molecules, each of which comprises an 
appropriate end, with a topoisomerase and a nucleic acid molecule which contams a 
topoisomerase recognition site at or near its ends and encodes a transcription activation 
S domain or a DNA binding domain. 

A first ds nucleic acid molecule useful in a method of the invention also can 
encode a ribonucleic acid (RNA) molecule, which can fimction, for example, as a 
riboprobe, an antisense nucleic acid molecule, a ribozyme, or a triplexing nucleic acid 
molecule, or can be used in an m vitro translation reaction, and the second ds nucleic acid 

10 molecule can mcode an expression control element usefiil for expressing an RNA &om 
the first nucleic acid molecule. For sample, where it is desired to produce a laxge 
amount of RNA, a second ds nucleic add molecule component for peifoiming a method 
of the invention can comprise an RNA polymerase promoter such as a T7, T3 or SP6 
RNA polymerase promoter. Whero the RNA molecule is to be expressed in a cell, for 

IS exanq)le, an antisoose molecule to be expressed in a mammalian cell, the second (or 

other) ds nucleic acid molecule can include a promoter that is active in a mammalian cell, 
particularly a tissue specific promoter, which is active only in a target cell. Furthermore, 
whero the RNA molecule is to be translated, for example, in a coiq>led in vitro 
transoiption/translation reaction, the first nucleic acid molecule or second (or ottier) 

20 nucleic add molecule can contain appropriate translational expression control elements. 

A directionally or non-directionally linked recombinant nucldc acid molecule 
gen^ed according to a method of the invention can be used for various purposes for 
which recombinant vectors containing a directionally or non-directionally inserted nucldc 
add molecule aro gmmiBiy used. Thus, the directionally or non-directionally linked 

25 nucldc acid molecule can be used, for exanq)le, fi>r espressmg a polypqitide in a cell, for 
diagnosing or treating a pathologic condition, or flie like. For administration to a living 
subject, tile directionally or non-directionally linked recombinant nucldc add molecule 
geoearally is formulated in a pharmaceutical conqK>siti<m sdlable fiir administration to the 
subject Tims, the invention provides phaniiaceuticalconqK>8iticms 001^^ 

30 directionally or non-directionally linked recombinant nucldc add molecule derated 
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according to a method of the invention and expression products of this nucleic acid 
molecule. As such, the nucleic add molecule is usefiil as a medicameaoit for tieatmg a 
subject suffering fiom a pathological condition. 

Phannaceutically acceptable carriers are well known in the art and include, for 
5 example, aqueous solutions such as water or physiologically buffered saline or other 
solvents or vehicles such as glycols, glycerol, oils such as olive oil or injectable organic 
esters. A pharmaceutically accq^table carrier can contain physiologically acceptable 
compounds that act, for example, to stabilize or to increase the absorption of the 
conjugate. Such physiologically acceptable compounds include, for example, 

10 catfoohydrates, such as glucose, sucrose or dextrans, antioxidants, such as ascoibic acid or 
glutathione, chelating agents, low molecular weight proteins or oth^ stabilizers or 
exdpients. One skilled in the art would know that the choice of a phannaceutically 
acceptable cairior, including a physiologically accq>table coxqpoimd, depends, for 
example, on the route of administration of the conq)osition, which can be, for example, 

IS orally or parmtorally such as intravenously, and by injection, uitubation, or other such 
method known in the art. The pharmaceutical composition also can contain a second 
reagent such as a diagnostic reagent, nutritional substance, toxin, or tiierapeutic agent, for 
exao^le, a cancer chemother^eutic agmt 

The directionally linked recombinant nucleic add molecule can be incorporated 
20 wiflun an encapsulating material such as into an oil-in-water emulsion, a microemulsion, 
micelle, mixed micelle, liposome, mioiosphere or other polymer matrix (see, for exairqple, 
Gregoriadis, Liposome Technology, VoL 1 (CRC Press, Boca Raton, FL 1984); Fraley, et 
al.. Trends Biochem. Sci, 6:77 (1981), each of which is incorporated herein by 
reference). liposomes, for example, which consist of phospholipids or oUbsx Upids, are 
25 nontoxic, physiologically accqitable and metabolizable carriers tibat are relatively sinqsle 
to make and admmister. ^^SteaUfa" liposomes (see» for example, U.S. Pat Nos. 5,882,679; 
5395,619; and 5,225,212, each of which is incorporated herein by reference) are an 
example of such »c£q[)sulating materials particularly useful far preparing a 
pharmaceutical composition, and other ''masked'' liposomes similarly can be used, such 
3 0 lq)osomes lending the time that a nucleic acid molecule r^nains m the drculation. 
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Catiomc liposomes, for example, also can be modified with specific receptois or ligands 
(Morishita et al., J. Clin. Invest, 91:2580-2585 (1993), which is incoiporated heran by 
reference). The nucleic acid molecule also can be introduced into a cell by complexing it 
with an adenovinis-polylysme conqilex (see, for example, Michael et aL, J. Biol. Ch&n. 
5 268:6866-6869 (1993), which is incorporated herein by refeimce). Such compositions 
can be particulaiiy usefiil for introducing a nucleic add molecule into a cell in vivo or in 
vitro, including ex vfvo, wherein the cell containing the nucleic add molecule is 
administered back to the subject (see U.S. Pat No. 5,399,346, which is incorporated 
hGtdn by reference). A nucldc acid molecule generated according to a method of the 
10 invention also can be introduced into a cell using a biolistic method (see, for example, 
S^es and Johnston, supra, 1999). 

As disclosed herein, a directionally linked nucldc add molecule generated 
according to a method of the invention contains a nick, which can be resolved, for 
example, by contacting the nicked recombinant nucleic acid molecule witib a ligase. Such 

15 a duecdonally linked recombinant nucldc add molecule that is covalently linked in both 
strands can be used as a tenq>late for an anq>lification reaction such as PGR. Assuch,a 
large amount of the construct can be generated. Furthemiore, an anq>lification reaction 
can provide an in vitro selection method for obtaining only a desired product, without 
obtaining partial reaction products. For example, a method of the invmtion can be used 

20 to generate a directionally linked recombinant nucldc acid molecule comprising, 

operatively linked in a 5' to 3' orimtation, a first ds nucldc acid molecule comprismg a 
promoter, a second ds nucldc add molecule comprising a coding region, and a third 
ds nucldc add molecule comprising a polyadenyiation signal, wherein the nicks in the 
gen^ed recombinant nucldc add molecule are ligated. 

25 By selecting a PGR primer pair including a first prim^ complementary to a 

nucleotide sequmce iQ)stream of the promoter sequence, and a second pruner 
conq)lementary to a nucleotide sequence downstream of tiie polyaden^ation signal, a 
fimctional amplification product conqmshig the promoter, coding r^on and 
polyadmylation signal can be generated, b contrast, partial reaction products fliat lade 

30 dther the first ds nucldc add molecule or fliird ds nucleotide are not anq)lified because 
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either the fiist or second primer, respectively^ will not hybridize to the partial product In 
addition, a construct lacking the second ds nucleic acid molecule would not be generated 
due to the lack of complementarity of the 5' overhanging sequences of the iBrst and third 
ds nucleic acid molecules. As sudi, the invention provides, in part, a means to obtain a 
5 desired functional, directionally linked recombinant nucleic acid molecule. 

The use of an amplification reaction such as PGR in such a manner further 
provides a means to screen a large nimiber of nucleic acid molecules generated according 
to a method of the invention in order to identify constructs of interest Since methods for 
utilizing PGR in automated high throughput analyses are routine and well known, it will 

10 be recognized that the methods of the invention can be readily adapted to use in a hi 

thioug^put system. Using such a system, a large numb^ of constructs can be screened in 
parallel, and partial or incomplete reaction products can be identified and disposed o^ 
thereby prevCTting a waste of time and e3q)ense that would otherwise be required to 
characterize the constructs or examine tiie functionality of the constructs in fiirther 

IS expoim^its. 

Recombinant nucleic acid molecules g^erated by a method of the invention 
wherein the first ds nucleic acid molecule contains a first topoisomerase but not a second 
topoisomerase or topoisomerase binding site, are generally linear, whereas, ia other 
aspects, the methods of the invention can generate circular recombinant nucleic acid 

20 molecules. However, a directionally linked recombinant nucleic acid molecule that is 
genemted as a linear molecule can be circularized, for example, where it is to be used as a 
vector, hi addition, a linear directionally linked recombinant nucleic acid molecule 
generated by a method of tiie invention can be cloned into a vector, which can b e a 
plasmid vector or a viral vector such as a bacteriophage, baculovirus, retrovirus, 

25 lentivirus, adenovirus. Vaccinia virus, s^nliki forest virus and adrao-associated virus 
vector, all of which are well known and can be purchased fi:om commercial sources 
Ohvitrogen Corp., La JoUa CA; Prompga, Madison Wl^ Stratagen^ La JoUa CA; 
GIBCOyBRL, Gaithersburg MD). 



30 



The methods of the invention also can be used to detectably label a nucleic acid 
molecule with a chemical or small organic or inorganic moiety sudi tliat the nucleic acid 
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molecule is useful as a probe. For example, a first ds nucleic add molecule, which has a 
topoisom«:ase recognition site, or cleavage product thereof at a 3' tenninus, can have 
bound thereto a detectable moiety such as a biotin, which can be detected using avidin or 
strq)tavidin, a fluorescent compound (e,g., Cy3, Cy5, Fam, fluorescein, or ihodamine)» a 
S radionuclide (e.g., sulfur-BS, technidum-99, phosphorus-32, or tritium), a paramagnetic 
spin label (e.g., carbon-13), a chemiluminescent compound, an epitope, for example a 
peptide epitope, which can be detected using an antibody that recognizes the epitope, or 
the like, such that, vpon generating a directionally linked double stranded recombinant 
nucleic acid molecule according to a method of the invention, the generated nucleic acid 

10 molecule wiU be labeled. Methods of detectably labeling a nucleic acid molecule witfi 
such moieties are well known in tiie art (see, for example, Hermanson, "Bioconjugate 
Techniques" (Academic Press 1996), which is mcoiporated herein by reference). It 
should be recognized that such elements as disclosed herein or otherwise known in the 
art, including nucldc acid molecules encoding cell compartmentalization domains, or 

15 detectable labels or tags, or comprising transcriptional or translation expression control 
elements can be usefiil components of a kit as disclosed herein. 

A method of the invention, in which a first ds nucleic add molecule with a first 
topoisomerase, but not a second topoisomerase or topoisomerase recogmtion site, is used 
can be particularly usefiil for generating an e?q)ressible recombinant nucldc acid 

20 molecule fliat can be inserted in a site specific manner into a taigrtDNA sequence. The 
target DNA sequence can be any DNA sequence, particularly a genomic DNA sequence, 
and preferably a gene for which some or all of the nucleotide sequence is known. The 
method can be p orformed utilizmg a first ds nucleic acid molecule, which has a first end 
and a second end and encodes a polypq)tide, for sample, a selectable marker, wherein 

25 the first ds nucldc acid molecule comprises a complementary sequ^ce at both ends; and 
directionally linking the first ds nucleic acid molecule to first and second PGR 
amplification products, which are generated fiom sequences upstream and downstream of 
the site at which the constmct is to be inserted, wherein each amplification products each 
contain a topoisom^ase recognition site and a 5* target sequence selected based on the 

30 &ctors set forth in the present disclosure. For example, the first and second amplification 
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products have dififer^t 5' taxget sequences such tfaa^ upon cleavage by a topoisomerase, 
each can be linked to a piedetennined end of fhe first ds nucleic acid molecule. 

The first and second amplification products are generated using two sets of PGR 
primer pairs. The two sets of PGR primer pairs are selected such that, in the presence of 
5 an ^propriate polymerase such as Tag polymerase and a template comprising the 

sequences to be amplified, fhe primers amplify portions of a target DNA sequence that are 
upstream of and adjacent to, and downstream of and adjacent to, the site for insertion of 
the selectable marker. In addition, the sets of PGR primer paurs are designed such that the 
amplification products contain a topoisomerase recognition site and, following cleavage 

10 by the site specific topoisomerase, a 5' oveifaanging sequence at the end to be 

duectionally Imked to the selectable maiken As such, the first PGR primer pair includes 
1) a first prim^, i^ch conq)rises, in an orientation fix>m S' to 3*, a nucleotide sequence 
complementary to a 5' con^>lanentary sequence of the end of the selectable marker to 
which the amplification product is to be directionally linked, a nucleotide sequence 

1 5 conq)lementary to a topoisomerase recognition site, and a nucleotide sequence 

complementary to a 3' sequence of a target DNA sequence upstream of flie insertion site; 
and 2) a second primer, which conq)rises a nucleotide sequence of the target genomic 
DNA upstream of the 3' sequence to which the first prim^ is conq)lementary, i.e., 
downstream ofthe insertion site. The second PGR primer pair includes 1) a first primer, 

20 which comprises, fix>m 5' to 3', a nucleotide sequence cGmplemmtary to fhe 

5' complementary sequence of tiie end ofthe selectable marker to which it is to be 
directionally linked, a nucleotide sequence conqilemratary to a topoisomerase recognition 
site, and a nucleotide sequence of a 5' sequence of a target DNA sequence, wherein the 
5' sequence ofthe target genomic DNA is downstream ofthe 3' sequence ofthe target 

25 DNA sequmce to vMch the first primer ofthe first PGR primer pair is complementary; 
and the second primer of the second primer pair comprises a nucleotide sequence 
complementary to a 3' sequence ofthe target DNA sequence that is downstream ofthe 
5' sequence ofthe target genomic DNA contained in the first primer. The skilled artisan 
will recognize that fhe sequences ofthe primer tiiat are complementary to tiie target 

30 genomic DNA are selected based on the sequence of tiie target DNA. 
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Upon contact of the fiist ds nucleic acid molecule comprising the selectable 
maiker, the JBrst and second amplification products (Le., second and tbiid ds nucleic acid 
moleculesX and a topoisomerase (unless the molecules are topoisomecase-duurged), a 
directionally linked recombinant nucleic acid molecule is generated Following ligation 
5 of fbe nicks, tiie generated recombinant nucleic acid molecule can be further amplified, if 
desired, using PGR primers fliat are specific for an upstream and downstream sequence of 
the target genomic DNA» thus ensuring Aat only functional constructs are amplified 
Such a generated directionally linked recombinant nucleic acid molecule is usefiil for 
performing homologous recombination in a genome, for example, to knock-out the 
10 function of a gene in a cell, or to confer a novel phenotype on the cell containing the 

generated recombinant nucleic acid molecule. The method can further be used to produce 
a transgenic non-human organism having the generated recombinant nucleic add 
molecule stably maintained in its genome. 

A method of the mvention involving a first ds nucleic add having a topoisomerase 
IS or topoisomerase-recogmlion site, for example, at a first end but not the second end, also 
can be usefiil for covalently linkmg an adq>ter or Unkec sequence to one or bodi ends of a 
second ds nucldc add molecule of interest, includiug to each of a plurality of second (or 
other) ds nucleic acid molecules. For example, where it is desired to put linkers on both 
ends of a first ds nucleic acid molecule, the method can be performed by contacting a 
20 topoisomerase with a first ds nucleic acid molecule, which has a 5' complementary 

sequence at or near each 5' terminus that is complementary to an overhang sequence on a 
5' t^minus of each of tiie second and third ds nucleic add molecules; and a second ds 
nucldc acid molecule and a third ds nucldc add molecule, each of which includes a 
tx^isomerase recognition site at the ^ropriate 3' teiminus and a S' overhang sequence 
25 at or near the end containing the topoisom«:ase recognition site. An ^ropriate terminus 
is the tenninus to which the linker is to be directionally linked to the first ds nucldc acid 
molecule. In performing such a method, the linker sequences comprising the second and 
at least thhd nucldc acid molecule can be the same or difr<a:ent 

Ameifaod of the invmtion mvolving a first ds nucleic add molecule with a 
30 5' target sequence and a topoisomerase on a first end, can be perfi>nned to dkectionally or 
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non-directionally link a first ds nucleic acid molecule to at least a second ds nucleic add 
molecule. The method typically is used to directionally link the first ds nucleic acid 
molecule and the second ds nucleic acid molecule. However, the method can be used to 
non-diiectionaUy link the first ds nucleic acid molecule and the second ds nucleic acid 
5 molecule in the following embodiments: 1) Where a second nucleotide sequence is 
present at or near tfie 5' tenninus of the second end of Ihe first ds nucleic acid molecule, 
that is cq)able of hybridizing to the compl^entary nucleotide sequence at the second 
end of the second ds nucleic add molecule; and 2) Where a nucleotide sequence is 
present at or near the 5' tenninus of both tilie first end and the second end of the second ds 
10 nucldc acid molecule that is c{q>able of hybridizmg to the 5* ov^ang at the first end of 
the first ds nucldc acid molecule. In fhese embodiments involving non-directional 
linking, the second end of file first ds nucleic acid molecule and the second end of the 
second ds nucleic acid molecule can be either blunt, or include an overhang. 

In CTibodiments involving a linking a third ds nucldc add molecule to the second 
IS ds nucldc acid molecule, the methods can be used to directionally or non-directionaUy 
link the two nucleic add molecules. The method typically is used to directionally link the 
second ds nucldc add molecule and fiie third ds nucleic acid molecule. However, the 
mefiiod can be used to non-directionally link the third ds nucldp acid molecule and die 
second ds nucldc acid molecule in the following embodiments: 1) Where a second 
20 nucleotide sequence is present at or near the S* terminus of the second end of the third ds 
nucleic acid molecule, that is ciqpable of hybridizing to the S' complementary nucleotide 
sequence at the second end of the second ds nucldc acid molecule; and 2) Where a 
nucleotide sequence is present at or near the S' temunos of both the first end and fhe 
second end of the second ds nucleic acid molecule that is capable of hybridizing to the 
25 5* overhang at the first end of the third ds nucldc acid molecule. In these embodiments 
involving non-directional linking, the second end of the third ds nucleic acid molecule 
and the second end of the second ds nucleic acid molecule can be either blunt, or include 
an overhang. 

The presmt invention also provides a conqiosition, wtdch includes a first 
30 ds nucldc acid molecule having a first end and a second end, wherdn the first end has a 
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5' overiiang and a topoisomerase covaleatly bound at the 3' terminus; and a second 
ds nucleic acid molecule having a first blunt end and a second end» wherdn the first blunt 
end has a first 5' nucleotide sequmce, which is complementary to the first 5'-oveihan& 
and a Gist 3' nucleotide sequence complan«tary to the first S* nucleotide sequence. In 
such a composition, the first 5' nucleotide sequence of the first blunt end of the second 
ds nucleic acid molecule can be hybridized to the first 5' ovechang of the fiorst end of the 
first nucleic acid molecule, wherein the first 3* nucleotide sequence of tiie first bhmt md 
of the second ds nucldc acid molecule displaced The first ds nucleic add molecule in 
such a composition can further have a second 5' ovohang at the second end, and the 
second end of the second ds nucleic acid molecule can further include a second 
S* nucleotide sequence, which is complementary to the second 5' ov^hang, and a second 
3' nucleotide sequence conq>Iementary to the second 5' nucleotide sequence. 

The present invention also provides kits, which contain one or more (e.g., 1, 2, 3, 
4, 5, 6, 7, 8, 9, 10, etc*) reagents usefiil for directionally or non-directionally linking ds 
nucleic acid molecules. In one embodiment, a kit of the invention contains a ds nucleic 
acid molecule having a first end and a second end, wherein the first end contains a first 
5' overhang and a first topoisomerase covalentiy bound at the 3' terminus, and the second 
md contains a second topoisomerase covalentiy bound at the 3' terminus and contains a 
second S' overhang, a blunt end, a 3* uridine ovechang, or a 3* thymidine ovedumg, 
wherein the first 5' oveihang is different fiom Ifae second S' overhang. The 
topoisomerases, which can be the same or different, also can be a component of tiie kit 
The ds nucleic acid molecule in tiie kit can, but need not be a vector, and can contain one 
or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, etc.) expr^on control elemmts, as well as 
instructions for using kit componrats. 

A kit of ttie invention also can include a plurality of second ds nucleic add 
molecules, v/hea^em each ds nucleic add molecule in tiie plurality has a first blunt end, 
and ^erdn ttie first blunt end includes a 5 ' nucleotide sequfflce complementary to flie 
first 5* overhang of the first ds nucldc acid molecule. The second ds nucleic add 
molecules in tiie plurality can be a plurality of transcrq)tional regulatory elements. 
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translational regulatory elements, or a combination thereof or can encode a plmality of 
pq)tides such as peptide tags, cell compartmentalization domains, and tibe like. 

A ds nucleic acid molecule component of a kit can be, for example, a linearized 
vector such as a cloning vector or expression vector. If desired, such a kit can contain a 
S plurality of ds nucleic acid molecules, each comprising a diff^ent expression control 
element or other element such as, but not limited to, a sequence encoding a tag or other 
detectable molecule or a cell conq>artmentalization domain. Tlie different elements can 
be different types of a particular expression control elraimt, for exanqile, constitutive or 
inducible promoters or tissue specific promoters, or can be different types of elements 

10 including, for example, transcriptional and transladonal expression control elements, 
epitq^e tags, and the like. Such ds nucleic acid molecules may be topoisomerase- 
activated or can be activated with topoisomerase, and contain 5' overhanging sequences, 
or sequences fliat become 5' ov^iianging sequences aft^ topoisomerase activation. In 
addition, the plurality of ds nuclric acid molecules can have S' overhangjmg sequences 

15 that are unique to a particular expression control element, or.tfaat are common to plurality 
of related expression control clients, for example, to a plurality of different promoter 
elements. The S' overfaang^g sequences of ds nucleic acid molecules can be designed 
such that one or more expression control elements contained on the ds nucldc acid 
molecule can be operatively directionally linked to provide a useful function, for 

20 exanq>le, an elCTient conqnising a Kozak sequence and an element comprising a 

translation start site can have complemratary S ' overhangs such that the elements can be 
qperatively directionally linked according to a method of tiie invention. 

The invention further provides kits for linkmg nucleic add molecules usmg 
methods described h^izL Thus, kits of the invention may conqirise one or more 

25 coinponents for perfiarmingmefliods described hmuL In particular mibodimmts, kits of 
the invention may comprise one or more conq)onent selected ficom the group consisting of 
instructions for use of kits conqionents, one or more buffers, one or more nucldc acid 
molecules (e.g., one or more nucldc add molecules having a 5' oveifaang, a 3* ovediang, 
a 5' overhang and a 3' oveihang, two 3' overhangs, two 5' oveifaangs, etc.), one or more 

30 topoisomerase, one or more ligase, one or more recombinase^ one or more adq>ter linker 
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for preparing molecules having a 5' oveifaang and/or a 3' ovetiiang, and/or one or more 
containers in which to perform methods of the invention. 

The following examples are intmded to illustrate but not limit the invention. 

EXAMPLE 1 

S Iq a preferred embodiment of the present invention, a topoisomerase-charged ds 

nucleic acid molecule is made by first obtaining a commercially available cloning vector. 
One such vector is pUniA^5-His version A, (Invitrogen Corp, Carisbad, CA), a circular 
siQiercoiled vector that contains uniquely designed elements. Tliese elements include a 
BGH polyadenylation sequence to inorease mKNA stability in eukaiyotic hosts, a 

10 T7 transcription teraiination region, an R6K7 DNA replication origm and a kanamycin 
resistance gene and promoter for antibiotic resistance selection. Additionally, 
pUniA^S-His version A contains a multiple cloning site, which is a synthetic DNA 
sequence encoding a series of restriction endonuclease recognition dtes. These sites are 
engmeered for cloning of DNA into a vector at a specific position. Also within ttie 

1 5 vector's multiple cloning site is a loxP site inserted S' to the endonuclease recognition 
sites thereby facilitating Ore recombinase-mediated fusion into a variety of other 
expression vectors, (Echo™ Qoning System, Invitrogen Corp., Carlsbad, CA). An 
optional C-terminal V5 qpitope tag is present for easy detection of ©q)ressed fusion 
protems using an Anti-V5 Antibody. An optional C-terminus poljdiistidine (6xHis) tag is 

20 also presmt to enable rapid purification and detection of expressed proteins. A bacterial 
ribosomal binding site downstream &om the loxP site makes transmption initiation in 
E. coli possible. Though this combination of elements is specific for pUniA^5-His 
version A cloning vector, many similar cloning and expression vectors are commCTcially 
available or can be assembled flrom sequences and by methods well known in the art 

2S pUni/yS-His version A is a 22kb double stranded plasmid (see Figures 3 and 5). 

Construction of a topoisomerase I charge cloning vector fiom pUniA^S-His 
version A is accomplished by endonuclease digestion of the vector, followed by 
complementary annealing of synthetic oligonucleotides and site-specific cleavage of the 
heteroduplex by Vaccinia topoisom0:ase L SacI and EcoRI are two of the many 
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restriction endonuclease sites present witiiin the multiple cloning site of pUniA^S-His 
v^ion A, (See FIG. 3). Digestion of pUniA^S-His version A with the corresponding 
restriction enzymes. Sad and EcoRI will leave cohesive ends on the vector (S'-AGCT- 
3* and 5'-AATT-3', Figure 6). These enzymes are readily available from numerous 
5 VCTdors including New Engjand Biolabs, (Beverly, MA, Catalogue Nos. R0156S, Sacl 
and ROIOIS, EcoRS). The digested pUniA^5-His version A is easily separated from the 
digested fiagments using isopropanol precipitation. These and other mettiods for 
digesting and isolating DNA are well known to those skilled in the art, (Sambrook et al., 
(1989) Molecular Cloning, A Laboratory Manual, Second edition. Cold Spring Harbor 
10 Laboratory Press; pages 5.28-5.32.) 

The purified, digested vector is fh&x incubated with two spedfic oligonucleotide 
adapters and T4 DNA ligase. The adapters are oligonucleotide duplexes containing ends 
that are compatible with the Sacl and EcoRI ends of the vector. QneofskiUintfaeart 
will readily q»preciate that other adapter oligonucleotides with appropriate sequences can 
15 be made for other vectors having diffomt restriction sites. Following mcubation with 
T4 DNA ligase, the vector containing tiie ligated adapters is purified usmg isopropanol. 

The ad^ter duplex that results from the annealing of TOPO Dl and TOPO D2 
has a single-stranded EcoRI overhang at one end and a 12 nucleotide single stranded 
overhang at the otber end. 

20 The first adapter oligonucleotide, (TOPO Dl), has complementation to the EcoRI 

cohesive end, 3 -TTAA-5\ Furthermore, TOPO Dl has an additional 24 bp mcluding the 
topoisoma:ase consensus pent^yrimidine element 5-CCCTT located 16 bp upstream of 
the 3* end. The ranaining sequence and size of TOPO Dl adapter oligo is variable, and 
can be modified to fit a researcher's particular needs. In the current embodiment 

25 5'-AAnGATCCCTICACCC3ACATAGTACAG-3' (SEQ ID N0:5) is the fiiU sequence 
oftheadq)teru8ed. 

The second adapter oligonucleotide, (TOPO D2), must have fiill complementation 
to TOPO Dl . TOPO D2 conq)lement5 direcfly 5' of ttie EcoRI cohesive flap, extending 
the bottom strand of the linearized vector. Additionally, TOPO D2 contains tiie sequence 
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3'-GTGG, which is fhe target sequence, and single-stranded overhang after topoisomerase 
cleavage, for directional cloning. In this mibodiment, fhe single stranded ovediang was 
chosen to complement the Kozak sequence known to help expression of ORPs in 
eukaryotic cells by increasing the effidmcy of ribosome binding on the mRNA» however, 
S sequence and length are highly variable to meet &e specific needs of individual users. 
The complete sequence of TOPO D2 is 3'-CTAGGGAACTGG-5' (SEQ ID N0:6). 

Similar to above, the adapter duplex that results 6om the annealing of 
oligonucleotides TOPO D4 and TOPO D5 has a single-stranded SacI overhang at one 
end, and a 12 nucleotide single-stranded overhang at fhe other end 

10 The third ad^ter oligonucleotide, (TOPO D5), has conq)lementation to the SacI 

cohesive end, 3*-TCGA-5'. Similar to TOPO Dl, TOPO D5 has additional bases creating 
a single stranded overhang. The Iragfh and sequence can vary based on tiie needs of the 
user. In the current embodiment TOPO D5*s sequence is 5'-AAGGGCGAGCT-3' (SEQ 
IDNO:7). 

15 The fourth ad^ter oligonucleotide, (TOPO D4), has fiill conq>lementation to 

TOPO D5, and complements dhecfly 5* of tiie SacI cohesive fl^ extrading the top strand 
of tiie linearized vector. TOPO D4 also contains the topoisomerase consensus sequence 
5'-CCCTT. The remaining sequence and size of TOPO D4 adapter oligo is variable and 
can be modified to fit a researcher's particular needs. In fhe current embodiment^ fhe 

20 sequence of TOPO D4 is 3'-GACATGATACAGTTCCCGC-5' (SEQ ID N0:8), which 
mcludes an additional 12 bp single stranded ov^ang. 

These ad^t^ oligonucleotides can be chemically syatiiesized using any of 
num^us techniques, including the phosphoramadite metiiod (Caruthers et aL, Meth. 
Bizvmol. 154:287-3 13» 1987). This and other methods for fhe chemical synthesis of 
25 ofigos iare well known to those of ordinary skill in fhe art 

Ckunplemmtary annealing of fhe purified digested vector and the adq>ter 
oligonucleotides is done by incubation of die DNA m fhe presence of T4 DNA ligase. 
Typical ligation reactions are permed by incubation of a cloning vector with suitable 
DNA fi^igments in tbe presence oftigase and an sq^propriate reaction Buffers for 
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ligation reactions should contain ATP to provide energy to for the reaction, as well as, 
reducing reagents like dithiothreitol and pH stabilizers like Tris-HCl. The ratio of 
concentrations for the cloning vector and the DNA fiagments are depend&A on each 
individual reaction, and formulae for their determination are abundant in the literature, 
5 (See e.g.. Protocols and Applications Guide (1991), Promega Corporation, Madison, WI, 
p.45). T4 ligase will catalyze the formation of a phosphodiester bond between adjacent 
S'-phosphates and 3 -hydroxyl t^mini during the incubation. Cohesive end ligation can 
generally be acconq)li^ed in 30 minutes at 12-15° C, while bhmt csid ligation requires 
4-16 hours at room ten^)erature, (Ausubel et aL, (1992) Second Edition; Short Protocols 

10 in Molecular Biology^ John Wiley & Sons, Inc., New York, NY, pages 3. 14-3.37), 
however parameter range varies for each experiment Id the current embodimrat, 
purified, digested pUuA^S-His vearsion A and the adq>ter oligos were incubated in the 
presence of T4 ligase and a suitable buffer for sbcteen hours at 12.5 *'C. The resulting 
linearized and ad^ted vector conq>rises the purified cloxiing vector attached to the 

15 adq)ter oligonucleotides through base pair complementation and T4 Ugase-catalyzed, 
phosphodiestOT bonds (see Figure 7). 

EfScient modification of the adapted vector with topoisomerase requires the 
addition of an annealmg oligo to gmerate double stranded DNA on TOPO Dl's and 
TOPO I>4's single stranded overiiangs. Vaccmia topoisomerase I initially binds non- 
20 covalendy to double stranded DNA. The ens^efliendifiuses along ttie duplex until 
locating and covalently attaching to the consensus pentapyrimidine sequence 5'-CCCTT, 
formii](g tiie topoisom«:ase adq>ted complex, (See Shuman et aL, U.S. Pat 
No. 5,766,891). Modification of the adapted vector takes place in the absence of DNA 
ligase to prevoit the formation of phosphodiester bonds between tiie adapted vector and 
25 the aimealing oli^» since phosphodiester bonds in the non-sdssile strand will prev^ the 
dissociation of the leavmg group upon cleavage, (Figures 8 and 9). 

The annealing oligonucleotide, (TOPO DS), must have coooplementation to the 
single stranded DNA overhangs of TOPO Dl and TOPO D4. In tiie current embodiment 
the ovetfaangs botii share the following sequence, 5'-GACATAGTACA&3' (SBQ ID 
30 N0:9). Therefixre, TOPO D3 has the following sequence. 
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3'-CTGTATCATGTCAAC-5' (SEQ ID NO:10), winch comprises fiiU complemeatation 
to the adi^ter oligos' single stranded ovetfaang and an additional 3 bp overhang, 
3'-AAC-5'. 

Incubation of the adapted vector with the annealing oligo in the presence of 
5 topoisomerase will create double stranded DNA to which topoisomerase can non- 

covalently bmd, (Figure 10). Bound topoisomerase will search the double stranded DNA 
by a fecilitated difiSision mechanism, until the 5 -CCCTT recognition motif is located. 
Cleavage of the phosphodiester backbone of the scissile strand 3' of the motif is catalyzed 
via a nucleophilic attack on the 3* phosphorous atom of the preferred oligonucleotide 

10 cleavage sequence 5'-CCCTT>lr resulting in covalent attachment of the DNA to the 

enzyme by a 3'-phosphotyn)syl linkage, (See Shuman et aL, (1989) Proc. Natl Acad. Sci, 
USA. 86, 9793-9796). Cleavage of the scissile strand oreates a double stranded leaving 
ffoxsp con5)risnig the 3' end ad^ta- oligo, downstream fiom the 5'-CCCTT motii^ and the 
annealing oligo TOPO D3. Although the leavmg groiqp can religate to the topoisomerase- 

IS modified end of the vector via 5' hydroxyl-mediated attack of the phosphotyrosfyl linkage, 
this reaction is disfavored wbsa the leaving group is no longer covalentiy attached to the 
vector. The addition of T4 polynucleotide kmase and ATP to the cleavage^ligation 
reaction further shifts the equilibrium toward the accumulation of trapped topoisoma:ase 
since the kinase can phosphorylate the 5' hydroxyl of tiie leaving gpnxp to prevent the 

20 rejoining fix)m takmg place, (Ausubel et al., (1992) Second Edition; Short Protocols in 
Molecular Biology, John Wiley & Sons, Inc., New York, NY, pp. 3.14-3,30). The 
resulting linearized vector conqirises a blunt md fiom the TOPO D4/D3 leavmg group 
and a single stranded overhang bearing end fit>m the TOPO D1/D3 leaving group, (Figure 
1 1). Both of the linearized cloning vector's ends are charged with topoisomerase, 

25 enabling fast, efiScient and directional topoisomerase mediated insertion of an acceptor 
molecule. 

Altfaougli the above example details tiie modification of pUniA^S-His version A to 
fi>nn the topoisom^rase-modified directional cloning vector, a person of ordinary sidll in 
the art will appreciate how to qiply tiiese methods to any plasmid, cosmid, vurus, or otiier 
30 DNA. It diodd also be noted that tins example demonstnttes a vector containing a 
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5' single stranded overhang comprising flie sequence 5'-GGTG-3', however the design of 
adapter duplexes and annealing oligonucleotides would allow one of skill in the art to 
custom design ovediangg of any sequence or length at one or both ends of a given vector. 

Specifically, any plasmid, cosmid, virus or other DNA can be modified to possess 
5 a single stranded overhang of any convenient sequence and length. These are the basic 
steps: the vector is first subjected to a treatment tiiat is known to Imearize the DNA. 
Common procedures include, but are not limited to, restriction digestion and treatment 
with topoisomerase H Following linearization, a custom single stranded ove±ang is 
added In the above example, complementary oligonucleotides are added to the sticky 
10 ends of a restriction digestion giving the desired single stranded oveifaang, however single 
stranded overhang forming oligonucleotides can be added by T4 blunt end ligation, as 
well. The single stranded overhang sequence is exposed by a topoisomerase I mediated, 
smgle strand nicking. la turn, this singile stranded overhang can be used to diiectionally 
insert a PGR product comprising one or more complimentary nucleotide sequences. 

1 5 Likewise, topoisomarase modification can be applied to any double-stranded 

plasmid, cosmid, virus or other piece of DNA. Methods for flie attachment of 
topoisomerase I to double stranded DNA are well known in the art, (See Shuman et al., 
U.S. Pat No. 5,766,891). The strategic placement of topoisomerase on to a piece of 
double stranded DNA is determined by the incorporation of a topoisomerase I consensus 

20 sequence, (See Shuman et al., U.S. Pat. No. 5,766,891). The topoisomerase I will bmd 
the double stranded DNA, nick the scissile strand thus revealing the pred^mnined 
single-stranded overhang sequence, and ligate the mcoming PGR product in the correct, 
single stranded overhang mediated otiratation. 

EXAMPLE 2 

25 As an example of the application of the present invention to another plasmid, 

pCR® 2.1, CPigutes 4 and 12), was modified to create a topoisomerase I adq>ted vector 
with a custom single stranded sequence. 

The pCSl® 2.1 plasmid is 3.9 kbT/A cloning vector. Wilhm the sequence of this 
vector are many uniquely designed elemmts. These el^ents include an fl origin, a 
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ColEl origin, a kanamycin resistance gme, an an4)iciUm resistance gene, a ZocZ-al^ba 
fragment and a multiple cloning sequrace located within the LacZ-alphB fragment 
allowing for blue-white selection of recombinant plasmids. The multq>le cloning 
sequence, (FIG. 4) of the pCR® 2.1 plasmid contains; numerous restriction sites, 
5 uxcluding but not limited to, HindlH, Spel and BcoRS; M13 forwaid and reverse primers 
and a T7 RNA polymerase promoter. 

Construction of flie topoisomerase I charged vector possessing a custom single 
stranded sequence consists of endonuclease digestion followed by complementary 
annealing of synthetic oligonucleotides and the site specific cleavage of the heterodiq)lex 

10 by Vaccinia topoisomerase L Digestion of the pCR^ 2. 1 plasmid wifli the restriction 
enzymes Hindlll, Spel and EcoRI leaves Hindm and EcoRI cohesive mds on the vector 
(Figure 13). Tlie dissociated fi:agmmt of pCR® 2.1 downstream from the BJndm 
cleavage site is fioiher cleaved with Spel in order to reduce its size. By reducing the size 
of the fragment, the digested vector is easily purified away from the smaller digested 

IS pieces by isopropanolprecq>itation. These enzymes are readily available fix)m numerous 
vendors including New England Biolabs, (Beverly, MA, Catalogue Nos.; ROI04S, 
Hindm; R0133S, Spel; ROIOIS, EcoRI). Methods for the digestion and the isolation of 
DNA are well known to those skilled m the art, (Sambrook et aL, supra, 1989). 

The purified digested vector is incubated with four ads^ter oligonucleotides and 
20 T4DNAIigase. These adapter oligonucleotides are designed to have compl^entation to 
dther the Hindm cohesive end, the EcoRI cohesive end, or to each otiier. Following 
incubation with T4 DNA ligase (he ad^ted vector is purified usmg isopropanoL 

The first adapter oligonucleotide, (TOPO H), has complementation to the Hindm 
cohesive end, 3'-TCGA-5*. Furthermore, TOPO H has an additional 24 bp including the 
25 topoisomerase consensus pents^yrimidine element 5 -CCCTT located 19 bp upstream of 
the 3* end The remaining sequence and size of TOPO H ad^ter oligo is variable, and 
can be modified to fit a researcher's particular needs. In tiie current embodiment 
5'-AGCTCG£CerTATTCCGATAGTG-3' (SEQ ID NO: 1 1) is flie fiill sequence of the 
adq>terused 
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The second ad^^ter oligDnucleotide, CTOPO 16), must have fiiU complementatioQ 
loTOPOH. TOP016complGmeats6ke(^^ 

the bottom sband of the linearized vector. Additionally, TOPO 16 contains tiie sequence 
3'-TAAG, which is the chosen single stranded sequence for durectional cloning. The 
complete sequence of TOPO 16 is 3'-GCGGGAATAAfi-5', (SEQ ID NO:12). 

The third adapter oligonucleotide, (TOPO 1), has complementation to the EcoRI 
cohesive end, 3'-TTAA-5'. Similar to TOPO H, TOPO 1 has additional bases containmg 
the topoisomerase I consensus sequmce CCCTT located 12 bp upstream of the 3' end. 
The length and sequence of TOPO 1 can vary based on the needs of the user, hi the 
current embodhnent TOPO I's sequence is S'-AATTCGCCCTTATTCCGATAGTG- 
3'(SEQIDNO:13). 

The fourfli adapter oligonucleotide, (TOPO 2), has fiill complementatibn to 
TOPO 1, and complements directly 5* of the EcoRI cohesive end extending the top strand 
of the linearized vector. In the current embodiment, the sequence of TOPO 2 is 
3'-GCGGGAA.5*. 

Complementary aimealing of the purified digested vector and the ad^ter 
oligonucleotides is done by mcubation of the DNA in the presence of T4 DNA ligase. 
T4 Ligase will catalyze the formation of a phosphodiester bond between adjacent 
5 -phosphates and 3 -hydroxyl termini during the incubation. In the current embodiment, 
purified, digested pCR® 2. 1 and the ad^tCT oligos were incubated m the presence of 
T4 ligase and a suitable buffer for sixteen hours at 12.5** C. The resulting linearized and 
adapted vector comprises the purified cloning vector attached to the ad^ter 
oligonucleotides through base pair complementation and T4 ligase-catalyzed, 
phosphodiester bonds (FIG. 13). Ligation techniques are abimdant in the literature, (see 
Ausubel et al., (1992) Second Edition; Short Protocols in Molecular Biology, John Wiley 
& Sons, lac. New York, NY, pp. 3,14-3.37). 

Charging of the adapted vector with topoisomerase requires the addition of 
annealing oligonucleotides to generate double stranded DNA on TOPO ITs and TOPO Ts 
single stranded ovoiiangs. CSiaigingofthe adapted vector takes place in the absence of 
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DNA ligase to pievent the foimation of phosphodiester bonds between the adapted vector 
and the annealing oligo, since phosphodiester bonds in the non-scissile strand will prevent 
the dissociation of the leaving group upon cleavage (see Figure 9). 

The annealing oligonucleotide, (TOPO 17), must have complementation to the 
5 single stranded DNA overhang of TOPO H. In the current embodiment tfie overhang has 
flie following sequence, 5'-CGATAGTG-3'. Therefore, TOPO 17 has flie following 
sequence, 3'-GCTATCAC-5', which conq)rises full complementation to the smgle 
stranded overhang of the ad^ter oligonucleotides. 

The annealing oligonucleotide, (TOPO 3), must have complementation to the 
10 single stranded DNA overhang of TOPO 1. In the current embodiment the overhang has 
the following sequence, 3'-GTGATAGCCTTA-5' (SEQ ID N0:14). Therefore, TOPO 3 
has the following sequence, 5'-CAACACTATCGGAAT-3' (SEQ ID N0:15), which 
comprises full complementation to the ad^ter oligonucleotide's single stranded overhang 
and an additional 3 bp overhang, S*-CAA-3*. 

1 5 Mcubation of the adapted vector with the annealing oligo in the presence of 

topoisomerase will create double stranded DNA to which topoisomCTase can non- 
covalently bind, (FIG. 14). Bound topoisomerase wiU search the double stranded DNA by 
a fecilitated diffusion mechanism, until the 5 -CCCTT recognition motif is located. 
Cleavage of the phosphodiestCT backbone of the scissile strand 3' of the motif will result 

20 in the covalent attachmrat of the DNA to the enzyme by a 3'-phosphotyrosyl linkage, 
(Shuman et al., Proc. Natl. Acad Sci. aSA. 86:9793-9796, 1989). Qeavage of the 
sdssile strand creates a double stranded leaving group conqmsing the 3* cad die adapter 
oligos, downstream fix>m the 5'-CCCTT motif, and the complemmtary annealing 
oligonucleotide. The leaving group can religate to the topoisomonse ad^ted vector 

25 throu^ its 5' hydroxyl's attack of Ifae phosphotyrosyl linkage, also catalyzed by 

topoisomerase. Addition ofT4 polynucleotide kinase to the equilibrium reaction prevents 
the back reaction via the kinase-mediated phosphorylation of tiie leaving group's 
5' hydroxyl, (Aosubel et al., (1992) Second Bditioiy Short Protocols in Molecular 
Biology, John Wil^ & Sons, Inc., New York, NY, pp. 3.14-3.30). The resulting 

30 Imeaiized vector comprises a blunt odfiom the TOPO 1/3 leaving group and a singile 
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stranded sequence md fipom the TOPO H/17 leaving group, (Figure 15). Botb of flie 
linearized cloning vector's mis are charged with tapoisoma:ase» enabling fiust, efBcient 
and directional topoisomerase mediated insertion of an accq>tor molecule. 

Directional cloning according to the invention. 

5 This invention also provides a method for directional cloning of DNA. Ihtiie 

following example, die topoisomorase-charged ds nucleic acid vector according to the 
present invention constructed from pUniA^5-His version A was used for the directional 
insertion of OKFs fiom the GeneStonn™ Expression Ready Qones, (InvitrogCTi Corp., 
Carlsbad, CA). The modified pUni vector was selected for the cloning of these ORFs 

10 because the added target sequence, which becomes a single strand overhang \xpon 
topoisomerase cleavage of the vector, has homology to the Kozak sequence known to 
enhance ORF expression. Note, however, that, as before, any plasmid, cosmid, virus or 
other DNA could be modified to possess the necessary single stranded sequence. 
Likewise, any DNA firagment could be modified to possess a homologous sequence to 

15 any single stranded overhang of a vector. As a point of interest, the sequence of tiie 

single stranded overhang can effect directional cloning efficiencies. For example, single 
stranded overhangs witii low GC content will have lower annealing stabihty, also single 
stranded overhangs that have higji complementation to both ends of a DNA fiagment to 
be cloned will loose the capability to direct these DNA inserts. Thus flie sequence of a 

20 single stranded overiiang should be carefiilly designed to avoid these and sioodlar 
problCTds. 

EXAMPLES 

The present invention is particularly usefiil in the dkectional insertion of PCR 
pnniuctsmto vectors constructed accordiiig to flie presoitinve^^ InthePCR 
25 anq)lification of the deshed ins^ flie PGR primers ace designed so as to conqilement 
identified sequences of the insert(s) that are to be directionally cloned into flie 
topoisomerase<harged ds nucldc acid vector of the present invention. The primer 
designed to bind upstream of the DNAs coding strand is modified with an additional 
conq)i^entary nucleotide sequence on its 5' end. The resulting PGR product will 
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possess a complemeatary sequence allowing single stranded oveifaang mediated 
directional insertion into tbe topoisomerase-^harged ds nucleic acid cloning vector of the 
presra^t invention and subsequent expression of tbe product 

One embodiment comprises introducing to a donor diq>lex DNA substrate a single 
5 stranded overhang site by PGR amplifying the donor diq)lex DNA molecule with the 
5' oligonucleotide primer containing the single stranded ovo-hang. PGR anq;>lification of 
a region of DNA is achieved by designing oligonucleotide primers that complement a 
known area outside of the desired region. In a preferred onbodiment flie primer that has 
homology to the coding strand of tfie double stranded region of DNA will possess an 
10 additional sequence of nucleotides complementary to the single strand oveifaang of the 
topoisomerase-chaiged ds nucleic acid cloning vector of the preset invention. 

Using the present invention in a higji througlq)ut format, we selected eigjity-two 
known ORFs from the GeneStorm™ CTcpression sj^tem, (Invitrogen Corporation, 
Carlsbad, CA) for directional cloning into the topoisomerase-charged ds nucleic acid 

15 vector of the present invention, however, any sequence of DNA can be selected as desired 
by individual users. For each of these ORFs, primers are designed with homology to the 
coding and the non-coding strands. To clone PGR products in a directional fashion into 
the modiJBed pUniA^5-His version A topoisomeraseK?harged ds nucleic acid vector of the 
present invention as described in example 1, one primer of a given pair was modified to 

20 contain primer of a given pair was modified to contain the nucleotide sequence 

complementary to the single strand ovediang contained within the vector Ihthecurrent 
exanq>le, the coding primer contained the added sequence 5'-CAGC-3', which 
complements fiie 'single stranded ovmliangV 3'-GTGG-5*, of the topoisomeras&<;harged 
ds nucleic add clonmg vector of the present invention. PGR amplification of the above 

25 ORFs with their respective primers will produce double stranded DNA fi»gmeats» vMdh 
possess the single strand overhang at their S* mi, O^igure 1 6). We used pfa polymerase in 
our PGR amplification, but it is well-known that PGR reactions can be pa:fi)rmed with 
either a nourthermophilic polymerase such as pfu or with a flioinqphilic polymerase like 
Tag followed by a blunting step to remove the hon-template nucleotide these enzymes 

30 leave at the end of PGR products. 
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In the present example, 0.1 microgram of each primer was combined with 
O.OS microgram of DNA containing an ORF in a PGR reaction mix totaling SO microliters 
total volume. Besides the primexs and vector, the reaction mix also contained water, PGR 
buff^ salts, lOmM dNTPs and 1.25 units ofpfa polymerase. Themial cycling 
5 tCTiperatures were as follows; an initial 94^ C deoaturation; followed by 25 repetitions of 
94^C denaturation, S5^C primer annealing, and 72^ C elongation, each at one minute; and 
Gaded with a 72^C, fiflem minute elongation* However these parametes will vary with 
each DNA fragmeat to be amplified. PGR amplification techniques are well known to 
those sldlled in tiie art, (Ausubel et al., (1992) Second Edition; Short Protocols in 
10 Molecular Biology, John Wiley & Sons, Inc., New York, NY, pp. 15.3-15.4), as are 
techniques for the conversion of 3' oveiliangs to blunt end termini, {Protocols and 
Applications Guide^ Promega Corp.; Madison WI, pp. 43-44, 1989). 

Incubation of the PGR anq[>lified donor duplex DNA containing die 
complementary nucleotide sequence with the modified pUiu/V5-Hi8 version A 

1 5 topoisom«:ase-charged ds nucleic add vector of the present inveation results in the 
dkectional cloning of tiie donor DNA. For example, the ei^ty-two ORFs fiiom the 
GeneStonn™ clone collection (bvitrogen Corporation, Garlsbad, GA) were amplified 
using adsqpted primers containing a conq)lementary nucleotide sequence^ Amplification 
of the 82 GeneStorm™ ORFs with the described modified primer pairs resulted in PGR 

20 products that had the complemmtary nucleotide sequence at their 5' end. This ORF PGR 
product is combined with 10 ng of topoisomerase-charged ds nucleic acid cloning vector 
ofthe present invention in either sterile water or a salt solution. The reaction is mixed 
gentiy and incubated for 5 minutes at room temperature (22-23^G). After five minutes, 
we placed the reaction on ice tiien proceeded to the One Shot® Gh^cal Transformation 

25 or Electroporation, (Invitrogen Gorporation, Garlsbad, GA, Gatalogue # G4040-10 and 
G4040-50, respectively), {Invitrogen TOPOGoning Protocol Invitrogen Gorp.), 
Topoisomerase had joined the adjacent strands ofthe vector and the product by catalyzing 
a rejoining reaction (Figure 17). DNA fiagm^ts constructed with the complemmtary 
nucleotide sequence at their 5* ends were thus cotrectiy inserted into topoisomerase- 

30 charged ds nucleic acid cloning vectors of the present invention with a high efficiency. 
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Directional ins^on of DNA fragmmts containing 5* sequences conq>lemeDtary 
into ds nucleic acid cloning vectors according to certain raibodimmts of flie present 
invention occurs witb greater than 90% efficiency as shown by sequencing multiple 
colonies of transformed host cells. In the current example^ the topoisomerase-charged ds 
S nucleic acid cloning vectors of the preset invention containing the GeneStonn™ OBFs 
were incubated witib transformation competent E. coli host cells. In seventy-four of the 
transformation reactions, the directional cloning of the ORFs into the topoisomeras&- 
charged ds nucleic acid cloning vector of the present invention occurred in at least sevm 
of the eight colonies picked, and fifty-nine of these cloning reactions were directional in 
10 all eight colonies picked. The overall directional cloning score was 609 of 656, thus, 
directional insertion was present in over 93% of tfie clones picked (see Table 1 below). 

EXAMPLE 4 

In a similar example, using the above described modified pCR®2.1 
topoisomerase-charged ds nucleic acid vector of the present invention, a PCR-generated 

IS ORF encoding Green Fluorescent Protem (GFP) was directionally cloned in fiame with 
the /acZ a firagment present in the vector (sr^ FIG. 4). The primers used to amplify the 
GFP gene contained the requisite complementary nucleotide sequence S -ATTC-3', and 
the known sequence for translation initiating methionine, 5 -ATG-3'. Using the necessary 
cloning stq)s noted above, tfie PC3R amplified GFP was inserted mto the vector and 

20 transformed cells were grown on solid Agar plates. Glowing colonies represented a 
correctiy inserted TfCSi product (see Table 2 below). 

These data represent a substantial improvement over the currmt state of the art in 
cloning, and fiirthennore present an invention in cloning that is highly compatible with 
hig^ tfarougjhput techniques. Given directional cloning efficiencies greater that 90%, a 
25 user need only screen two colonies for each cloned DNA firagment Thus, pn a 96-well 
plate, forty-eight separate clones can be screened for duectional insertion, 400% more 
than curmt cloning techniques. Use of this invention will streamline many high 
througjqput ^e eqxression operations, and allow them to run at fi:action of their current 
costs. 



30 
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. Table 1. Directional Cloning of ORFs using a topoisomerase-eharged ds nucleic acid 



cloning vector of the present invention 

Posifive colonies. dPCR reasons Clpnei^tested 

8/8 . 59 

7/8 15 

6/8 .2 

5/8 1 

4/8 3 

3/8 2 

5 Table 2. In frame and directional insertion of GFP into modified pCR2.1 
topoisomerase-eharged ds nucleic acid cloning vector of tibe present invention 

PGR producf s 5' sequence Percentage of concept Total vMte colonies 

insi^. (contain aiecbnibinant 

plasmid) 

5*-ATTCATG-3' homologous 86% 457 

S'-CAAGATG-3' non- 35% 118 
homologous 

5'-ATTCGGATG-3' frame shift . 0% 268 

VECTOR ONLY 0% 31 



Altfaou^ the invention has been described wifli rcference to the above examples, it 
10 will be understood that modifications and variations are oacompassed within the spirit and 
scope of the invenlioa Accordingly, flie inveaition is limited only by the following claims. 
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What is claimed is: 

1. A method for generating a directionally linked recombinant nucleic acid 
molecule, the method comprising contacting: 

a) a topoisomerase-charged first double stranded (ds) nucleic acid 

5 molecule, comprising a first topoisomerase covalently bound at or near a first end, 

and a second topoisomerase covalently bound at or near a second end, said fiist 
end further comprising a first 5* overhang, and said second end fiirther comprising 
a blunt end, a 3* thymidine overhang, or a second 5' overiiang; and 

b) a second ds nucleic acid molecule, comprising a first blunt end and a 
10 second end, wherein the first blunt end comprises at its 5' temiinus, a nucleotide 

sequence complementary to the first 5' overiiang, 

under conditions such that die nucleotide sequence conq)lementary to the first 
5' overhang can selectively hybridize to the first 5* overiiang, 

whereby the first topoisomerase can covalentiy link the 3' terminus of the first end 
IS of the first ds nucleic add molecule with the 5* terminus of the first end of the second ds 
nucleic acid molecule, and 

whereby the second topoisomerase can covalentiy link the 3* terminus of the 
second end of the first nucleic acid molecule to the 5' terminus of the second end of the 
second ds nucleic acid molecule, tiier^y gmerating a directionally linked nucleic acid 
20 molecule. 



2. The method of claim 1, wherein the second end of the first ds nucleic add 
molecule com^mses a blunt end, and die second end, of the second ds nucldc acid 
molecule conqprises a blunt end. 

25 

3. The method of claim 1, wherein the second end of the topoisomerase-chaiged 
first ds nucleic acid molecule comprises a 3' thymidhie overiiang, and the second end of 
(he seccmd ds nucldc add molecule comprises a 3' adenosine ovohang. 



30 
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4. The me&od of claim 1, wherein Ihe topoisomerase-charged, first ds nucldc 
acid molecule comprises a second S' oveihang at the second end, and the second 

ds nucleic acid comprising at flie second end, a nucleotide sequence conQ)lementaiy to the 
second 5' overhang. 

5 

5. The method of claim 1, wherein the first ds nucleic acid molecule is a vector. 

6. The method of S, wherein the topoisomerase-charged first ds nucleic acid 
molecule is a cloning vector. 

10 

7. The method of 6, i;^erein the topoisomerase-charged first ds nucleic acid 
molecule is an expression vector. 

8. The method of claim 1, further comprising introducing the directionaUy-liiiked 
1 S recombinant nucleic acid molecule into a cell. 

9. The method of claim 8, wherein the cell is a eukaryotic cell. 

10. Themethodof claim 9, wherein die cell is a mammalian cell. 

20 

11. A cell produced by the methods of claim 8. 

12. A transgenic non-human organism gmerated fix>m the cell of claim 11. 

25 13. The mediodofclaim 8, wherein the cell is a bacterium. 

. 14. The method of claim 1, wherein the second ds nucldc acid molecule 
comprises an amplification product. 

30 IS. The method of claim 1, whoiein the topoisomerase is a type IB 

topoisomerase. 
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16. The method of claim 1, wherein flie second ds nucleic acid molecule 
comprises one of a plurality of second ds nucleic add molecules. 

17. The method of claim 16, wherein second ds nucldc acid molecules in the 
S plurality are dififerentfiom each o&er. 

18. The method of claim 16, wh^in said plurality of second ds nucleotide 
molecules comprises a cDNA library. 

10 19. The method of claim 1 6, wherein said plurality of second ds nucleotide 

molecules comprises a combinatorial library. 

20. A recombinant nucleic acid molecule produced by the method of claim 1 . 

15 21 . A method for generating a directionally linked recombinant nucleic acid 

molecule, the method comprising contacting: 

a) a first precursor double stranded (ds) nucleic acid molecule comprising 
a first end, which comprises at the 5' terminus, a first 5' target sequence, 

and at the 3* terminus, a topoisomerase recognition site; and 
20 a second md which comprises at the 3' terminus, a topoisomerase 

recognition site; 

b) a second ds nucleic acid molecule comprising a first blunt end and a 
second end, wherein the first blunt end comprises at the 5 ' terminus a nucleotide 
sequence conq)lementary to die 5* target sequence; and 

25 c) a topoisomerase specific for the topoisomeiase recognition site, 

undo: conditions that allow topoisoma:ase activity, and that allow hybridization of 
the first 5' target sequence and the nucleotide sequence conqplementaiy to the target 
sequence, fiiereby generating a dkectionally Imked reccmibinant nucleic add molecule. 

30 22. The mediodofclaim 21, ^ei:ein the second end of the fii^ 

nucldc acid becomes a blunt end iq>on cleavage by the topoisomerase, and the second 
end of the second ds nucldc add molecule is a blunt end 
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23. The method of claim 21, wherein the second end of the fiist precursor ds 
nucleic acid molecule conqmses a 3' fh)pmidme extension upon cleavage by the 
topoisomwase, and the second aid of the second ds nucleic acid molecule comprises a 

5 3' adenosine overhang. 

24. The method of claun 21, wherem flie first precursor ds nucleic acid molecule 
conq)rises a second 5* target sequence located at the second end and the second ds nucleic 
acid molecule comprises at the second end, a nucleic acid sequence complementary to the 

10 second S* target sequence. 

25. The method of claim 24, wherem the first precursor ds nucleic add molecule 
is a vector. 

26. The method of claim 25, wherein the vector is an expression vector. 

27. The method of claim 21, fiirther comprising introducing the directionally- 
linked recombinant nucleic acid molecule into a cell 

28. The method of claim 21, wherein the first precursor ds nucleic acid molecule 
comprises an expression control element and the second ds nucleic add molecule 
conqjrises an open reading frame, wherem in the dkectionally Imked recombinant nucleic 
acid molecule, the expression control element is operatively liiiked to the open reading 
frame. 

29. The method of claim 21, wherem the second ds nucleic add molecule 
camprisGs an aziq>Iification product 

30. llie method ofclaun 21, wherem the topoisomerase is a type IB 
topoisomerase. 
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31. The melhod of claim 21, wherein the second ds nucleic acid molecule 
comprises one of a phurality of second ds nucleic acid molecules. 

32. The method of claim 31, wherein said plurality of second ds nucleotide 
molecules conqirises a cDNA library. 

33. A recombuiant nucleic acid molecule produced by the meOiod of claim 21. 

34. A method for generating a directionally linked recombinant nucleic acid 
molecule, the method comprising contacting: 

a) a topoisomerase-charged first double stranded (ds) nucleic acid 
molecule, comprising a first topoisomerase covalently bound to the 3' terminus of 
a first end of the ds nucleic acid molecule, said first end fiirther comprising a first 
5' overhang; and 

b) a second ds nucleic acid molecule, comprising a first blunt end and a 
second aid, wherein the first blunt end comprises a 5' nucleotide sequence 
complementary to the first 5' overiiang, 

under conditions such that the 5' nucleotide sequence of the first blunt end can 
selectively hybridize to the first 5' overhang, 

whereby the first topoisomerase can covalently link the 3' terminus of the first end 
of the first ds nucleic acid molecule with the 5' tominus of the first end of the second ds 
nucleic acid molecule. 
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35. The method of claim 34, further conqirismg contacting the topoisomerase- 
charged first ds nucleic acid molecule and the second ds nucleic acid molecule witfi a 
third ds nucleic add molecule, 

wha:ein a first end of the third nucleic ds acid molecule comprises a 5' overhang 
5 and a second topoisomerase covalmtly bound at the 3' tmninus, and 

wherdn the second ds nucleic acid molecule further conq>rises a second blunt end, 
^ch comprises a 5' nucleotide sequence conq>lementary to the second 5* ovodhang, and 

wherein the contacting is performed under conditions such that the 5' nucleotide 
sequence of die second blunt end of the second ds nucleic acid can selectively hybridize 
10 to the 5* overhang of the first end of the third ds nucleic acid molecule, 

whereby the second topoisomerase can covalently link the 3' terminus of the first 
end of the third ds nucleic acid molecule with the 5' terminus of the second blunt end of 
the second ds nucleic acid molecule. 

15 36. The method of claim 35, wherein the first ds nucleic acid molecule is 

directionally Unked to the second ds nucleic acid molecule and, thereafter, the third ds 
nucleic acid molecule is directionally linked to the second ds nucleic acid molecule. 

37. The method of claim 34, wh^ein the first ds nucleic acid molecule is 
20 operatively linked to die second ds nucldc acid molecule. 

38. The method of claim 34, wherehi the first ds nucleic acid comprises an 
expression control elemmt and die second ds nucleic acid conqirises an open reading 
fiame. 



25 
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39. The mettiod of claim 35, whCTein the first ds nucleic acid molecule comprises 
an expression control element, the second ds nucleic acid molecule comprises an open 
reading fiame, and the third ds nucleic acid molecule encodes a pqitide, 

whereui, in fhe directionally linked recombinant nucleic add molecule, the 
5 expression control elonent is operatively Imked to the open reading fiame, and the 
second ds nucleic acid molecule is operatively linked to fhe third ds nucleic acid 
molecule, and 

wherein the second ds nucleic acid molecule is operatively linked to the third ds 
nucleic acid molecule encode a fusion protein comprising the open reading fiame and the 
10 peptide. 

40. The method ofclaim 39, wherein the peptide comprises a tag. 

41. An isolated double stranded (ds) nucleic acid molecule, comprising a first 
15 topoisomerase covalently bound at a 3' temiinus of a first end, and a second 

topoisomerase covalently bound at a 3' terminus of a second end, said first end fiirther 
comprising a first 5' overhang and said second end fialher comprismg a blunt end, a 
3* thymidine overhang, or a second 5' oveihang, wherein said first 5* overhang is diflferent 
fiom said second 5' overhang. 

20 

42. The ds nucleic acid molecule ofclaim 41, wherein the second end comprises 
a blunt end. 

43. The ds nucleic acid molecule ofclaim 41, whorem the second end comprises 
25 a single 3' thymidine overhang. 

44. The ds nucleic add molecule of claim 41, wherem the second end comprises 
a second 5' oveihang. 

30 45. The dsnucldc acid molecule ofclaim 41, wherem the firsts* overhang 

comprises the nucleotide sequence 5 -GGTG-3*. 
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46. The ds nucleic add molecule of claim 41 , wherein flie ds nucleic acid 
molecule is a vector. 

47. The ds nucleic acid molecule of claim 46, wherein the vector fiirflier 
5 comprises a recombinase site. 

48. The ds nucleic acid molecule of claim 47, wh^^ the vector fiirther 
comprises a lox site. 

10 49. The ds nucleic acid molecule of claim 46, wherein the ds nucleic acid 

molecule is a cloning vector. 

50. The ds nucldc acid molecule of claim 49, wh»:ein the ds nucleic acid 
molecule is an expression vector. 

15 

51. The ds nucleic acid molecule ofclaim 48, wherein the first end and the * 
second end are adjacent sequ^ces of a nucleotide sequence encoding a selectable marker. 

52. The ds nucleic add molecule of claim 46, wherein the vector is a 
20 pUniA^5-ffis version A vector (SEQ ID N0:16). 

53. The ds nucldc acid molecule ofclaim 52, wherein ttie 5' overhang of the first 
end comprises 5'-GGTG-3', and wherem the second end is a blunt end, which conq)rises a 
topoisomerase recognition site at the 3' temunus. 

25 

54. The ds nucldc add molecule ofclaim 46, n^erdn the vector is a pCR®2. 1 

vector; 

55. The ds nucldc add molecule of claim 54, \idierdn the 5' overhang of the first 
30 end comprises 5 -GAAT-3*, and wfaerdn the second miA is a blunt end, whidi comprises a 

topoisomerase recognition site at the 3' terminus. 
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56. A compositioiiy comprising: 

a) a first ds nucldc acid molecule con^ristng a first end and a second end, 
wherein the first end comprises a 5' overiiang and a topoisomerase covalently 
bound at the 3' terminus, and 
5 b) a second ds nucleic add molecule conq)ris]ng a first blunt end and a 

second end, wherein the first blunt end comprises a fia:st S* nucleotide sequence, 
which is complCTientary to the first 5**ovajiang, and a first 3* nucleotide sequence 
conq>lementary to the first 5' nucleotide sequence. 

10 57. The conq>osition of claim 56, wherein the first 5' nucleotide sequence of the 

first blunt end of the second ds nucleic acid molecule is hybridized to the first 5' overhang 
of the first end of the first nucldc acid molecule, and the first 3' nucleotide sequence of 
the first blunt end of the second ds nucldc acid molecule is displaced. 

15 58. The composition of claim 56, wiierein the first ds nucldc acid molecule 

further comprises a second 5' overiiang at the second end, 

wherein the second end of the second ds nucleic acid molecule further comprises a 
second 5' nucleotide sequence, which is complementary to the second 5* ovahang, and a 
second 3* nucleotide sequence complemmtary to the second 5* nucleotide sequence. 

20 

59. A kit containing the ds nucleic acid molecule of claim 1. 

60. The kit of claim 59, wherem the nucldc add molecule is a vector. 



25 



61 . The kit of claim 59, fiirtti^ conqxrising an expression control element 
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62. A kit, comprising 

a) a first double stranded (ds) nucldc add molectde, vMch comprises a 
first topoisomerase covalently bound at a 3' terminus of a fiist end, and a second 
topoisomerase covalently bound at a 3' terminus of a second end, 

5 said first end fiirtfa^ comprising a first 5 ' ovofaang and said second end 

further compri^g abhmt end, a 3' thymidine oveiliang, or a second 5' overhang, 
herein said first 5' overhang is different fix>m said second S' oveifaang; and 

b) a plurality of second ds nucleic acid molecules, wherein each ds nucleic 
add molecule in the plurality comprises a first blunt end, and wherein fihe first 

10 blunt CTd con^rises a 5* nucleotide sequence complementary to the first 

5' overhang of the first ds nucldc acid molecule. 

63. The kit of claim 62, wherein the second ds nucldc acid molecules in the 
plurality comprise transcriptional regulatory elements, translational regulatory elements, 

15 or a combination thereof. 

64. The kit of claim 62, wherein the second ds nucleic acid molecules in the 
plurality conaprise nucleotide sequences encoding a peptide. 
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MULTIPLE CLONING SUE 

I 

Unil Fofwantpfiningaite 
GA6CTTA6TA OGTACTATCA ACAG6TTGAA CTGCTGATCA ACAGATCCTC 



TACX3CGGCCG CGCTACC ATA ACT TCG TAT AGC ATA CAT TAT ACG 



AAG TTA TCG GAG6AAT TGGC^TCGAGG AATTCAfccGG TGCCGTG^^GG 



GCGGATCCGG GCCOGACGTC AGGCc JcCTt' CGC^ G CTC 0(31 AAG CCT 

Gly Lys Pro 

V5 epitope 

- ■ - — I I — 

ATC CCT AAC CCT CTC CTC GGT CTC GAT TCT AGC CAT CAT 
lie Pro Asn Pro Leu Leu Gly Leu Asp Ser Ser His His 



fixHbtag Unil Reverse piimings 



CAC CAT CAC CAT TGA AGCTQGCTA TCAGCCTCGA CTGTGCCTTC 
His His His His *** 



TAGTTGCCAG CCATCXGTTG TTTGCCCCTC CCCCGTGCCT 



pUniAA5-His A,B,C^ 

2.3 kb 



FIGURES 
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. MULTIPLE CLONING SITE 



lacZaATG 
M13 Rerverse Primer | 



CA6 G^A ACA GOT ATG 
GTC CTT TGT CGA TAG 



a5 

33 



C ATG ATT AOG CCA AGC TTG GTA COG AGD TOG GAT CCA CTA 
GTAC TAA TGC GOT TOG AAC CAT OGC TCG AOG CTA GGT GAT 



BsfXI EcoRI 

GTA ACG GCX^ GOC AGT GTG^G GIVA TTC GGC 
CAT TOO OGG OGQ TCA CAG GAC CTT AAG COG 



GnBI 

GOC GAA TTC TOG 
TT OGG CTT AAG AOG 



Avat 
PaeR7l 

6coRV BsOCI Noll Xhot NsilXbai Apa\ 

WW tJvT CCA TCA GAC TGG OGG COG GTC CAG CAT GCa'tCT AGA QGG GOC AAT TOG 
TCr ATA GGT AGT GTG AOG GOG GQC GAG GTC GTA GGT ACA TGT COO QGG TTA AGO 



COG TAT 
GC3B ATA 
^ 



T7 Promoter 



AGT &VG TOG TAT 
TCA CTC AGC ATA 



AT|< 



M13 Focwaitr (>20) Primer 
GTG GOC GTG GFT TTA^aA 



M13 Fomard (-40) Primer 
^T GGT CAG TGS GAA AAG| 



1 




FIGURE 4 
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' Sequence of pUniA^S-His version A 
AATTTXXTATCrrCAGKXXnTAAGTXnT^^ 

AcnxKxnTACATCXxnxKKrntmtnxxACAA 

TirrrrnxnTATAAAACrrAAAACClTAGAG^ 

AAACATCAGAGCTTAGTACGTCAAACATCAGAGCITACTAC^^ 

ATGAGGGTrrAGTrcCTTAAACATGAGAGCITAGTAOTTrAAAC^ 

GAGCTTAGTACGTACnTaXIAAaVGGTTGAACIXK^ 

Aaran^ATAGCATACATTATAaJAAGTrAT^^ 

G<XK5ATO^3e§GC^^ 

CGATfef^G&AlCATC^ 

OCAlCIO-ilUiilGCOCCTOCOCCXnXXX:^^ 

AATAAAATCAGGAAATIGCATCGCATIXnCXGAGT^ 

GGACAGCAAGGGGGAGOATTGGGAAGACAAtAGCAGGCATGClX^^ 

AOWWVGOCXXJAAAGGAAGCTCAGTOKKnXKri^^ 

CXITCTAAAaKKmnTCAGGGGTTr^^ 

AGAACItXrAGCATGAGATCXXXGOGCIGGAGGATCAt^^ 

<XAA(XTITCATAGAAGGCGGCq€ntK}AATO 

CATnOGAA(XOCAGAGTPCCCG€TCAGAAGAACTCGT^^ 

TCGGGAGOGGCGATACCGTA>^GCACXjAGGAAGCGGTCAG<X^ 

TC^VCGGGTAGCXrAACGCrATOTCCT^^ 

AAAAGOGGCCATITIXXACCATGATATIlgXKj^^ 

GTXXKKK:ATGCGaKXnTGAGCClXKKX5AACAGT^^ 

tcatxxtoatcgacaagacxx?gctnxatd& 

gaatcggcaggtagccggatc/^g<x3tatgcagoogcx:gc^^ 

gcaggagcaaggtcagatcacaggagatcxrnxxxxxkk^ 

GCTrcAGTCACAACXnx:GAGCACAGCIXKX^ 

GCXnXXnXXTXK:AGTrcATrcAGGGCAOOGGACA(K^^ 

CHGACAGCOGGAACAOGGOGGCATCAGAGCAGOCGAITC^ 

CIXXACTCAAGCGGOCGGAGAAannGCXnXKV^^ 

GTCICrroATCAGATCTTCATCXXXI^^ 

GCAGGGCTltXCAACCrrACCAGAGGGCGCC^ 

gcccagtctagctatcg(x:atgtaagcccaci^ 
gtccagatagoocagtagcroacattcatxxg 

CCGCTiraTTAGOVGCXCntK^^ 



FIGURES 
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Add EcoRI and Sad 
digestion atzymes 



5'-G 

3*-crrAA 

x^esuiting Gohesive end 
post EcoRI cEgest 



TOGAC-5'j 

Resulting 
cc^esive end post 
Sad digest 



nGURB6 
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5'-XXXXXXXXXXXXX?pCXXCCCTT XXXXXXXX-3* 
3'-xxxxxxxxxxxxxxxxxQGGAA xxxxxxxx-5* 

"^0 



ADD TOPOISOMERASE 

V 




TOPQ/ 



^Leaviiig group 

S'ODCXXXXJDDODDOKXXXC^ XXXXXXXX-3' 



5AA xxxxxxxx-5* 

© 



FIGUBE8 
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5'-xxxxxxxxxxxxxxxxxccxnT 

3*-xxxxxxxxxxxxxxxxxGGGAA 



XXXXXXXX-3' 
xxxxxxxx-5' 



ADD TOPOISOMERASE 




5*-XXXXXXXXXXJDCXXXXXCCCTT 
3*-xxxxxxxxxxxxxxxxxGGGAA. 



lESssodated leaiong group 



XXXXXXXX-3' 

XXXXXXXX-S* 



FIGURE 9 
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Sequence of pCR 2.1 

AGCGCXX:7WVTACXK:AAACCGCXrnriXXX^^ 

TCXXXjACTGGAAAGCGGGCAGTGAGa5CAAL(^ 

GCTTTACACITrATXKnTCCGGC^^ 

ACAGCTATGACXlATCATrAOGCXIAAGCra 

TGGAATrOGGCnTAA€KXXJAATTCItK:AGA^^ 

GGCXXAATTOGCCCTATACnXJACnOGTATrAC^^ 

AAA(X)CTOGCXyrrAO(X:AACTrAATX:^^ 

6AGOCOCGCACXXIATCXKXXrnXXX:AACACr^ 

GC!(K:ArrAAGCXK:GGCGGGTGTGGTXKnTAC^^ 

CGCTXxrrrroGcrrrcrixxxT^ 

GG<Xrn:XXrmAGGOTTXXGATrTAGAGCm 

gttcacgtagtxkkkx:atcxkxxjix3at^^ 
TA(nxK>ACixjnxjmx:AAAcr^ 

ATITItKXXJATTTOGGCCTATIW^ 

AAGOAAOCXK5AACACGTAGAAAGCX:AGTCaK:AGAAACCKnV^^ 

GCrATCriGGACAAGGGAAAACX3CAAGOGCAAAGAGAAAGCAGGT^^ 

AGCTAGACTCGGCGGTTTrATGGACAGCAAGCOAAC^^ 

TGGGAAGCCCTtK:AAAGTAAACTGGATCGCIT^ 

TCTIGATCAAOAGACAGGATGAGOATCGTnXXX:^^ 

(XXK7nXKK7rGGAOAGGCrATIX33GCrATC 

GTItXG<KnGTCAGCGCAGGGGCGCXXXK/llU 

CIXK:AGGAC»AGGCAGCGa3GCTATXXri^^ 

TIXnnCACTOAAG<:XKK5AAGGGAClX3GCrGCT . 

(XrrrocnXnXKXX?AGAAAGTATXXATC 

A(XrrCKX:CATrTOACCACX::AAGCX5A>^^ 

TCXJATCAGGATGATCTGGACXJAAGAGCATCAGGGGCTXX^ 

GCGCMDGCCXXJACXSGOGAGGAlXnTXnXOTGAT^ 

AATOGCXXKnrrnriXK^ATTCAACGACTXmXKXX 

TCKJATAOCXXjTGATATIXXnXJAAGAGCXrGGCQG^ 

CXKX3GCnCOCGATIXXKrAGCXK:ATCXKX^ 

AGAGTATGAGTATTCAACATITCXXJroTOGOCXriT 

lXKnx:AO0CAGAAAOGClXKnX5AAAGT^^ 

GAACroGATCTCAACAGOGGTAAGATra 

CTITTAAAGTICIGCTATtntrATAC^ 

GGOOCXjCTTATICrcAGAATOAU^ 

ACAGTAAGAOAATrAlXK:AGTtXnGOCAT^ 

aJAlXXKJAGGACOTAAGGAGCTAAOCGCTriT^^ 

TrCKK;AACXXK3AGCIOAATGAAGCX:ATAOCAAAOaAOT 

ACAAOTrrGCXKIAAACTATrAACIGGro 

TXX5AGG0GOATAAAGTnK:AGOACrACTrcri^^ 

ATtnXKJAGCXXKyroAGCXmKKSTCnXXMX^ 

AIXXnAGTTATCTACAOGACGGGGAGTtAGGC^ 

GT^KXTKACroATTAAGCATIXKn^AACI^^ 

ACrrcATTTITAATITAAAAGGATCTAGGTCAAGA 

CXnGAGTTnXOTTCCACTCAGCGT^ 

AGAGCTAOCAACTXriTmxXGAAGGTAACIGC^^ 

OTOTAGCOGTAGTTAGGCCACCACnTCAAGAACT^^ 

TGTTAOCAGTCGCTGCKKXAGTCGCGA 

GGATAAGGCGCAGCGGTOOGGCTOAAOGGGGGGTItOT^ 

ACXXJAACTOAGATACCTACAGCXnOAGCATroAGAAAGCXK^ 

GGTATOXHTTAAGOGGCAGGGTCXfGAAC^^ 

TXHTTATAGTCCIXnOKKn^ 

CXKJAGCCTATCOAAAAAOGCXZAGCAAOaC^^ • 

ACATtmxnrixxnxKxmva^ 

CGClCXXXXK:AGCOGAAa>AOC»AGCXK:AGCX}AC^ 



FIGURE 12 
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FIGUSB13 
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TOPO 



CCCTT 
GGGAATAAG 



M13 
reverse 



Z 



AAGGG 
JTOCC 



TOPO 



T7 

promoter 
=31 



OBI 




forward 



LINEARIZED, TOPO 
CHARGED, FLAP 
VECTOIL 

Modified pCR 2.1 



M13 
forward 



PIORI 



Resistance 



Kanamydn 
resistaace 



FIGURE 15 
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SBQTTENCE LISTING , 
<110> INVITROGEN CORPORATION 
CHBSTNUT, John 
SHUMAN, Stewcurt 
HEYKAN, John 
MADDEN, IQout 
BENNETT, Rob 

<120> METHODS AND REAGENTS FOR MOLECULAR CLONING 

<130> INVIT1300WO 

<150> US 60/226,563 
<151> 2000-08-21 

<160> 18 

<170> Patentin version 3.0 

<210> 1 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Vaccinia tppoisomerase cleavable sequence 

<400> 1 
gcccttattc cc 



<210> 2 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Vaccinia topoisomerase cleavable sequence 

<400> 2 
tcgcccttat tc 



<210> 3 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Vaccinia tppoisotnerase cleavable seqpience 

<400> 3 
tgtcgccctt at 



<210> 4 

<211> 12 

<2X2> DNA 

<213> Airtificial Bec[uence 



<220> 
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<223> Vaccinia tqpoisomerase cleavable sequence 

<400> 4 
gtgtcgccct ta 



<210> 5 

<211> 28 

<2i2> vm. 

<213> Artificial sequence 
<220> 

<223> Adapter oligonucleotide 

<400> 5 

aattgatccc ttcaccgaca tagtacag 



<210> 6 

<211> 12 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Adapter oligonucleotide 

<400> 6 
ggtgaaggga tc 



<210> 7 

<211> 11 

<212> DNA 

<213> Artificial 8eq[uence 
<220> 

<223> Adapter oligonucleotide 

<400> 7 
aagggcgagc t 



<210> 8 

<211> 19 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Adapter oligonucleotide 

<400> 8 

cgcccttgac atagtacag 



<210> 9 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Intermediate vector overhang sequence 
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<400> 9 
gacatagtac ag 



12 



<210> 10 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Annealing oligonucleotide 

<400> 10 

caactgtact atgtc 15 



<210> 11 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Adopter oligonucleotide 



<210> 12 

<211> 11 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Adopter oligonucleotide 

<400> 12 

gaataagggc g 11 



<210> 13 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Adopter oligonucleotide 



<210> 14 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Intermediate vector overhang sequence 



<400> 11 

agctcgccct tattccgata gtg 



23 



<400> 13 

aattcgccct tattccgata gtg 



23 



3 
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<400:^ 14 

attccgatag tg 12 



<210> 15 

<21l> 15 

<212> DKA 

<213> Artificial sequence 
<220> 

<223> Annealing oligonucleotide 

<400> 15 

caacactatc ggaat 15 



<210> 16 
<211> 2290 
<212> DNA 

<213> Artificial sequence 








<220> 

<223> pUni/V5-His version A vector 






<400> 16 
aattcccatg 


tcagccgtta 


agtgttcctg 


tgtcactcaa 


aattgctttg agaggctcta 


60 


agggcttctc 


agtgcgttac 


atccctggct 


tgttgtccac 


aaccgttaaa ccttaaaagc 


120 


tttaaaagcc 


ctauacatuc 




tataaaactt 


aaciaccttag aggctattta 


180 


agttgctgat 


ttatattaat 


tttattgttc 


aaacatgaga 


gcttagtacg tgaaacatga 


240 


gagcttagta 


cgttagccat 


gagagcttag 


tacgttagcc 


atgagggttt agttcgttaa 


300 


acatgagagc 


ttagtacgtt 


aaacatgaga 


gcttagtacg 


tgaaacatga gagcttagta 


360 


cgtactatca 


acaggttgaa 


ctgctgatca 


acagatcctc 


tacgcggccg cggtaccata 


420 


acttcgtata 


gcatacatta 


tacgaagtta 


tcggaggaat 


tggctcgagg aattcaccgg 


480 


tgccgtgtgg 


gcggatccgg 


gcccgacgtc 


aggcctcgat 


cggagctcgg taagcctatc 


540 


cctaaccctc 


tcctcggtct 


cgattctagc 


catcatcacc 


atcaccattg aagctcgcta 


600 


tcagcctcga 


ctgtgccttc 


tagttgccag 


ccatctgttg 


tttgcccctc ccccgtgcct 


660 


tccttgaccc 


tggaaggtgc 


cactcccact 


gtcctttcct 


aataaaatga ggaaattgca 


720 


tcgcattgtc 


tgagtaggtg 


tcattctatt 


ctggggggtg 


gggtggggca ggacagcaag 


780 


ggggaggatt 


gggaagacaa 


tagcaggcat 


gctggggatt 


ctagaagatc cggctgctaa 


840 


caaagcccga 


aaggaagctg 


agttggctgc 


tgccaccgct 


gagcaataac tagcataacc 


900 


ccttggggcc 


tctaaacggg 


tcttgagggg 


ttttttgctg 


aaaggaggaa ctatatccgg 


960 


atatcccggg 


gtgggcgaag 


aactccagca 


tgagatcccc 


gcgctggagg atcatccagc 


1020 


cggcgtcccg 


gaaaacgatt 


ccgaagccca 


acctttcata 


gaaggcggcg gtggaatcga 


1080 



4 
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aatctcgtga tggcaggttg ggcgtcgctt ggtcggtcat ttcgaacccc agagtcccgc 1140 

tcagaagaac tcgtcaagaa ggcgatagaa ggcgatgcgc tgcgaatcgg gagcggcgat 1200 

accgtaaagc acgaggaagc ggtcagccca ttcgccgcca agctcttcag caatatcacg 1260 

ggtagccaac gctatgtcct gatagcggtc cgccacaccc agccggccac agtcgatgaa 1320 

tccagaaaag cggccatttt ccaccatgat attcggcaag caggcatcgc catgtgtcac 1380 

gacgagatcc tcgccgtcgg gcatgcgcgc cttgagcctg gcgaacagtt cggctggcgc 1440 

gagcccctga tgctcttcgt ccagatcatc ctgatcgaca agaccggctt ccatccgagt 1500 

acgtgctcgc tcgatgcgat gtttcgcttg gtggtcgaat gggcaggtag ccggatcaag 1560 

cgtatgcagc cgccgcattg catcagccat gatggatact ttctcggcag gagcaaggtg 1620 

agatgacagg agatcctgcc ccggcacttc gcccaatagc agccagtccc ttcccgcttc 1680 

agtgacaacg tcgagcacag ctgcgcaagg aacgcccgtc gtggccagcc acgatagccg 1740 

cgctgcctcg tcctgcagtt cattcagggc accggacagg tcggtcttga ceiaaaagaac 1800 

cgggcgcccc tgcgctgaca gccggaacac ggcggcatca gagcagccga ttgtctgttg 1860 

tgcccagtca tagccgaata gcctctccac ccaagcggcc ggagaacctg cgtgcaatcc 1920 

atcttgttca atcatgcgaa acgatcctca tcctgtctct tgatcagatc ttgatcccct 1980 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 2040 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 2100 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 2160 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 2220 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 2280 

gcgtgaagct 2290 

<210> 17 
<211> 3906 
<212> WA 

<213> Artificial secpience 
<220> 

<223> PCR2.1 vector 
<400> 17 

agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60 
acgacaggtt tcccgactgg aaiagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120 

tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180 

ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagcttg 240 

gtaccgagct cggatccact agtaacggcc gccagtgtgc tggaattcgg cttaagccga 300 
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attctgcaga 


tatccatcac 


actggcggcc 


gctcgagcat 


gcatctagag 


ggcccaattc 


360 


gccctatagt 


gagtcgtatt 


acaattcact 


ggccgtcgtt 


ttacaacgtc 


gtgactggga 


420 


aaaccctggc 


gttacccaac 


ttaatcgcct 


tgcagcacat 


ccccctttcg 


ccagctggcg 


480 


taatagcgaa 


gaggcccgca 


ccgatcgccc 


ttcccaacag 


ttgcgcagcc 


tgaatggcga 


540 


atgggacgcg 


ccctgtagcg 


gcgcattaag 


cgcggcgggt 


gtggtggtta 


cgcgcagcgt 


600 


gaccgctaca 


cttgccagcg 


ccctagcgcc 


cgctcctttc 


gctttcttcc 


cttcctttct 


660 


cgccacgttc 


gccggctttc 


cccgtcaagc 


tctaaatcgg 


gggctccctt 


tagggttccg 


720 


atttagagct 


ttacggcacc 


tcgaccgcaa 


aaaacttgat 


ttgggtgatg 


gttcacgtag 


780 


tgggccatcg 


CGCtgataga 


cggtttttcg 


ccctttgacg 


ttggagtcca 


cgttctttaa 


840 


tagtggactc 


ttgttccaaa 


ctggaacaac 


actcaaccct 


atcgcggtct 


attcttttga 


900 


tttataaggg 


attttgccga 


tttcggccta 


ttggttaaaa 


aatgagctga 


tttaacaaat 


960 


tcagggcgca 


agggctgcta 


aaggaaccgg 


aacacgtaga 


aagccagtcc 


gcagaaacgg 


1020 


tgctgacccc 


ggatgaatgt 


cagctactgg 


gctatctgga 


caagggaaaa 


cgcaagcgca 


1080 


aagagaaagc 


aggtagcttg 


cagtgggctt 


acatggcgat 


agctagactg 


ggcggtttta 


1140 


tggacagcaa 


gcgaaccgga 


attgccagct 


ggggcgccct 


ctggtaaggt 


tgggaagccc 


1200 


tgcaaagtaa 


actggatggc 


tttcttgccg 


ccaaggatct 


gatggcgcag 


gggatcaaga 


1260 


tctgatcaag 


agacaggatg 


aggatcgttt 


cgcatgattg 


aacaagatgg 


attgcacgca 


1320 


ggttctccgg 


ccgcttgggt 


ggagaggcta 


ttcggctatg 


actgggcaca 


acagacaatc 


1380 


ggctgctctg 


atgccgccgt 


gttccggctg 


tcagogcagg 


ggcgcccggt 


tctttttgtc 


1440 


aagaccgacc 


tgtccggtgc 


cctgaatgaa 


ctgcaggacg 


aggcagcgcg 


gctatcgtgg 


1500 


ctggccacga 


cgggcgttcc 


ttgcgcagct 


gtgctcgacg 


ttgtcactga 


agcgggaagg 


1560 


gactggctgc 


tattgggcga 


agtgccgggg 


caggatctcc 


tgtcatctcg 


ccttgctcct 


1620 


gccgagaaag 


tatccatcat 


ggctgatgca 


atgcggcggc 


tgcatacgct 


tgatccggct 


1680 


acctgcccat 


tcgaccacca 


agcgaaacat 


cgcatcgagc 


gagcacgtac 


tcggatggaa 


1740 


gccggtcttg 


tcgatcagga 


tgatctggac 


gaagagcatc 


aggggctcgc 


gccagccgaa 


1800 


ctgttcgcca 


ggctcaaggc 


gcgcatgccc 


gacggcgagg 


atctcgtcgt 


gatccatggc 


1860 


gatgcctgct 


tgccgaatat 


catggtggaa 


aatggccgct 


tttctggatt 


caacgactgt 


1920 


ggccggctgg 


gtgtggcgga 


ccgctatcag 


gacatagcgt 


tggatacccg 


tgatattgct 


1980 


gaagagcttg 


gcggcgaatg 


ggctgaccgc 


ttcctcgtgc 


tttacggtat 


cgccgctccc 


2040 


gattcgcagc 


gcatcgcctt 


ctatcgcctt 


cttgacgagt 


tcttctgaat 


tgaaaaagga 


2100 
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agagtatgag 


tattcaacat 


ttccgtgtcg 


cccttattcc 


cttttttgcg 


gcattttgcc 


2160 


ttcctgtttt 


tgctcaccca 


gaaacgctgg 


tgaaagtaaa 


agatgctgaa 


gatcagttgg 


2220 


gtgcacgagt 


gggttacatc 


gaactggatc 


tcaacagcgg 


taagatcctt 


gagagttttc 


2280 


gccccgaaga 


acgttttcca 


atgatgagca 


cttttaaagt 


tctgctatgt 


catacactat 


2340 


tatcccgtat 


tgacgccggg 


caagagcaac 


tcggtcgccg 


ggcgcggtat 


tctcagaatg 


2400 


acttggttga 


gtactcacca 


gtcacagaaa 


agcatcttac 


ggatggcatg 


acagtaagag 


2460 


aattatgcag 


tgctgccata 


accatgagtg 


ataacactgc 


ggccaactta 


cttctgacaa 


2520 


cgatcggagg 


accgaaggag 


ctaaccgctt 


ttttgcacaa 


catgggggat 


catgtaactc 


2580 


gccttgatcg 


ttgggaaccg 


gagctgaatg 


aagccatacc 


aaacgacgag 


agtgacacca 


2640 


cgatgcctgt 


agcaatgcca 


acaacgttgc 


gcaaactatt 


aactggcgaa 


ctacttactc 


2700 


tagcttcccg 


gcaacaatta 


atagactgga 


tggaggcgga 


taaagttgca 


ggaccacttc 


2760 


tgcgctCQQC 


ccttccggct 


ggctggttta 


ttgctgataa 


atctggagcc 


ggtgagcgtg 


2820 


ocrtcticcicciq 


tatcatbgca 


gcactggqgc 


cagatggtaa 


gccctcccgt 


atcgtagtta 


2880 


tctacacgac 


crggQaqtcag 


gcaactatgg 


atgaacgaaa 


tagacagatc 


gctgagatag 


2940 


gtgcctcact: 


gattaagcat 


tggtaactgt 


cagaccaagt 


ttactcatat 


atactttaga 


3000 


t;tgatttaaa 


acttcatttt 


taatttaaaa 


ggatctaggt 


gaagatcctt 


tttgataatc 


3060 


tcatgaccaa 


aatcccttaa 


cgtgagtttt 


cgttccactg 


agcgtcagac 


cccgtagaaa 


3120 


agatcaaagg 


atcttcttga 


gatccttttt 


ttctgcgcgt 


aatctgctgc 


ttgcaaacaa 


3180 


aaaaaccacc 


gctaccagcg 


gtggtttgtt 


tgccggatca 


agagctacca 


actctttttc 


3240 


cgaaggtaac 


tggcttcagc 


agagcgcaga 


taccaaatac 


tgtccttcta 


gtgtagccgt 


3300 


agttaggcca 


ccacntcaag 


aactctgtag 


caccgcctac 


atacctcgct 


ctgctaatcc 


3360 


tgttaccagt 


ggctgctgcc 


agtggcgata 


agtcgtgtct 


taccgggttg 


gactcaagac 


3420 


gatagttacc 


ggataaggcg 


cagcggtcgg 


gctgaacggg 


gggttcgtgc 


acacagccca 


3480 


gcttggagcg 


aacgacctac 


accgaactga 


gatacctaca 


gcgtgagcat 


tgagaaagcg 


3540 


ccacgcttcc 


cgaagggaga 


aaggcggaca 


ggtatccggt 


aagcggcagg 


gtcggaacag 


3600 


gagagcgcac 


gagggagctt 


ccagggggaa 


acgcctggta 


tctttatagt 


cctgtcgggt 


3660 


ttcgccacct 


ctgacttgag 


cgtcgatttt 


tgtgatgctc 


gtcagggggg 


cggagcctat 


3720 
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cttttactcio 


cctttt.gctc 


3780 


acatgttctt 


tcctgcgtta 


tcccctgatt 


ctgtggataa 


ccgtattacc 


gcctttgagt 


3840 


gagctgatac 


cgctcgccgc 


agccgaacga 


ccgagcgcag 


cgagtcagtg 


agcgaggaag 


3900 


cggaag 












3906 
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<210> 18 
<211> 310 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> ptiNl vector multiple cloning site 
<400> 18 

gagcttagta cgtactatca acaggttgaa ctgctgatca acagatcctc tacgcggccg 60 
cggtaccata acttcgtata gcatacatta tacgaagtta tcggaggaat tggctcgagg 120 
aattcaccgg tgccgtgtgg gcggatccgg gcccgacgtc aggcctcgat cggagctcgg ISO 
taagcctatc cctaaccctc tcctcggtct cgattctagc catcatcacc atcaccattg 240 
aagctcgcta tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc 300 
ccccgtgcct 
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